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CODON-OPTIMIZBD POLYNUCLEOTIDE-BASED VACCINES 

AGABiST BAcnxusAimiRACisiimcnoi^ 



BACKGROUND OF TEBB INVENTION 
Field of the laveation 

Historically, antbrax infection is associated vAUi herd animals and was 
not commonly seen as a human pathogen (Mock, M. and Fouet, A Annual 
Review of Microbiology 55:647-671(2001)). Therefore, it is not surprising 
that zoonotic Bacillus anthracis infection and pathogenesis in humans is not 
well characterized. However, anthrax has become a greater human disease 
problem with the realization that anthrax spores could be weaponized. It is 
now widely accq)ted that A anthracis spores can be inexpensively produced, 
are extremely stable when properly stored, and could be effectively distributed 
in populated areas. Consequently, B. anthracis becomes an ideal organism for 
use as a biological weapon and opens up the possibility of an intentional and 
major outbreak of infection in humans. R^earch during the past 10-15 years 
has provided an increasing amount of mformation about the molecular basis of 
disease in humans, providing the scientific basis &r developing specific 
diagnostics and defined subunit vaccines. 

Related Alt 

In addition to developing mc»e rapid and sensitive diagnostics, 
molecular biological methods enable the development of defined subunit 
vaccines to counter biotem)risnL Indeed, safe, effective recombinant subunit 
vaccines would significantly reduce, and peihqis eliminate, the need for 
therapeutic treatments. In the case of A anthracis, virulence is the results of a 
nndti--con]^xment toxin secrrted by the organism. The toxin consists of three 
separate gene products designated protective antigen (PA), lethal fector (LF) 
and edema &ctor (EF). The genes encoding these toxin components (pag, lej^ 
and cya, respectively) are located on a 184-ld) plasmid designated pXOl. 
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pXOl, along with a second plasmid, pX02 carrying cj?)siale gaies thought to 
protect bacilli fcom host cell phagocytosis, are required for full anthrax 
virulence and are carried by all vimlent strams ofB. anihrads (Mikesell, P., et 
al Infect bnmun. 39: 371-376 (1983)). PA (735 aa, MW 82,684) is a smgle 

5 chain protein which binds to a mammalian cell sor&ce receptor. Upon 

cleavage by furin (or a furin-like enzyme activi^), it is cleaved into a 63-kDa 
receptor-bound product (Leppla, S JBL., "Production and purification of anthrax 
toxin," in Methods in Emymology. S. Harshman» ed., Academic Press, Inc., 
Orlando, PL (1988), pp. 103-1 16; Klnnpel, KJL, et al, Proc Natl Acad. ScL 

10 (USA) 5^:10277-10281 (1992); Gordon, VAt, etal., Infect. Immun. 63:92-Z7 

(1995); Petosa, C, et al., Nature 355:8833-8838 (1997)). The 63-kDa PA 
fi:agment forms a heptameric complex on the mammalian cell surface vMch is 
capable of interacting imNfa tiie 90-kDa LP protein and the 89-kDa EF protein, 
^ch are sobsequenfly internalized (MOne^ J.C., et al, J. Biol Chem. 

15 2«):20607-20612 (1994); Petosa, C„ et al., Nature 355:8833-8838 (1997)). 

LP (776 aa, MW 90,237) is a zinc metalloprotease that cleaves several 
isofbrms of MAP kinase kinase (Mekl, Mek2, MKK3), thereby dimipting 
signal traxisduction events within the cell and eventually leading to cell death 
(Duesbery, N.S., et al, Science 250:734^737 (1998); Pellizari, et al, 

20 FEBS Ltrs 462:199-204. (1999)). The EF protein (767 aa, MW 88,808) is a 

calmodulin-dependent ad^ylate cyclase that causes deregulation of cellular 
physiology, leading to clinical manifestations that include ed^na (Leppla, 
SJa., Proa Nad. Acad. Sci. (USA) 7P:3162-3166 (1982)). The LF protein, 
whidi togeth^ with PA is referred to as lethal toxin (Letx), is considered 

25 responsible for the rq>id lethality of anthrax infection (Pannifer, A., et oL, 

Nature 414:229-232. (2001)). 

Protection against anthrax infection is associated with a hmnoral 
inmnme r^onse directed against PA (Ivins, BJB. and Welkos, Eur. X 
Epidemiol -/(1):12^19 (1988); Ivins, B., et al. Vaccine 13:1779-17^ (1995)). 

30 Some evidence soggests that BP and LF may also contribute to specific 

inoonunity (little, SJ?. andSnudson, Infect Immun. 52:509-512. (1986); 
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Ivins, BJB. and Welkos, SX,, Eur. J. Epidemiol -#(1): 12-19 (1988); Pezaid, C, 
et aL, Infect, Immim. (55:1369-1372 (1995)), although these components have 
not been fozmulated into a subumt vaccine. 

The cuirenl FDA-approved anthrax vaccine, Anthrax Vaccine 

S Adsoibed (AVA), is produced from the culture supCTatant fraction of die 

V770-NP1-R strain of 5. anthracis. Its principal con]^onent is the PA 
antigen adsorbed onto aluminum hydroxide. The production process is 
conoplex and the precise conoposition of the bacterial cell supernatant is not 
weU G^iaracterized. Consequmtly» there is a significant lot-to-lot variation, fii 

10 addition, the iqpproved vaccination regimen is less than optimal for compliance 

and convenience: AVA is administered subcutaneously in a 0.5 ml volume at 
0, 2» and 4 weeks and Aen again at 6, 12, and 18 months. Annual boosts are 
also required. 

Recently tiiere has been a report of potential safety concerns in 

15 pregnant women, although Ihe causal reladonshq) has not been well 

established. As a result of these and ottier lay press reports, there is a negative 
public perception about the reliability and quaUty of the AVA ^s^icdne even 
though the actual safety of the vaccine has never been seriously questioned in 
the sciefutific literature. A major concern with the current AVA anthrax 

20 vaccine is the paucity of analytical characterization of the actual composition 

of the vaccine prqiaratioit It has been suggested that the presence of minute 
amounts of unspecified con^onents may contribute to the adverse events that 
have been associated with administration of the AVA vaccine. In contrast, 
DNA vaccines are d^gned to elicit innnunity against discrete, well-defined 

25 target antigens and are unlikely to be the subject of the same criticism. In 

short, DNA vacdnes can be nmltxvalent and yet hig^y defined. 

Daring tiie past few years there has been substantial interest in testing 
DNA-based vacdnes for a numb^ of infectious diseases where the need for a 
vaccine, or an inq>roved vacdne, exists. Several weU-recognized advantages 

30 of DNA-based vacdnes include tiie speed, ease and cost of manu&cture, the 

versatility of developing and testing multivalent vacdnes, the finding that 
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DNA vaccines can produce a robust cellular response in a wide variety of 
animal models as well as in man, and the proven safety of using plasmid DNA 
as a delivery VK*>r (Donnelly, JJ,, et al., Anntu Rev, Immunol 75:617-648 
(1997); Manickan, E., et al, Crit. Rev. Immunol i7(2):139-154 (1997)). 
DNA vaccines represent the next generation in the development of vaccines 
(Nossal, a, Nat, Med 4(5 Supple):47S-476 (1998)) and numerous DNA 
vaccines are in clinical trials. 

DNA-based immunization have already beoi shown, in animal models, 
to protect against a lethal challenge of anthrax toxin. The initial published 
work indicated that a plasmid encoding the protease-cleaved fiagmeot (PAes) 
of PA (Gu, et al,. Vaccine J7:340-344 (1999)) elicited ittotedive immunity 
against a lethal toxin challenge. Price, et al.. Infection and Immunity 69:4509' 
4515 (2001) extended these observations and demonstrated that DNA-based 
immunization with a ftagment of the LP gene product would also contribute to 
or provide protection against a lethal toxin challenge. Having established 
proof of principle in preclinical studies, we now propose to develc^ an 
aggr^sive product development plan ihsX will lead to an efficacious human 
vaccine against B. anihracis using a DNA-based immunization strategy. 

Retooling coding regions encoding polypeptides from pathogens using 
codon firequencies preferred in a given mammalian species often results in a 
significant increase in expression in ttie cells of that mammaUan species, and 
concomitant increase in immunogenicity. See, e,g.^ Demi, L., et oL, J, Virol 
75:10991-11001 (2001), and Narum, DL, et al, Irtfect. Immun. 5P:7250-7253 
(2001). 

There remains a need in the art for convenient, safe, and ef&cacious 
immunogiBmc confounds to protect vertebrates against BadUus anihracis 
infection. The present invention provides safe yet effective immunogenic 
compounds and methods to protect vertebrates against BadUus anihracis 
infection usmg such immuno|^c compounds. 
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SUMMARY OF THE INVB^ 

The ptesent inveodon is directed to enhancmg immune response of a 
vibrate in need of ptotection against anOirax infection by administenng in 
VIVO, into a tissue of a vertebrate^ a polynucleotide conqirisiiig a codcm- 
optimized coding region encoding a component of Bacillus antkrads lethal 
toxin or nucleic acid fragments of such coding regions encoding fragments, 
variants, or derivatives thereof Nucleic acid fragments of the present 
invotion are altered from ttieir native state in one or more of the following 
ways* First, a nucleic add fragmmt which aicodes a conq>onent of the A 
antkrads lethal toxin may be part or all of a codon-optimized coding region^ 
optimized according to codon usage in a giv^ species, eg,, a vertebrate 
Species, e.g., a mammalian species, e.g,y humans. In addition, a nucleic acid 
fragment which encodes a conq)oneDt of the B, antkrads lethal toxin may be a 
firagment which encodes only a portion of a full-length polypeptide, and/or 
may be mutated so as to, for example, remove from the encoded polypeptide 
adventitious protein moti& present in the encoded polypeptide or virulence 
Actors associated witii the encoded polypqitide. For example, the nucleic 
add sequence could be mutated so as not to encode adveartitious N-linked 
glycosylation moti& (Nf-X-(S or T), where X is any ammo add). The 
polynucleotides are incorporated into the cells of the vertebrate in vrvp, and a 
prophylactically or tiierapeutically effective amount of a Badllus antkrads 
lethal toxin ccmq>onent is produced in vivo. 

The invention furOier provides immunogenic con^ositions conqnising 
a polynucleotide v^ch ccmptises one or more codon-optimized coding 
regions encoding conq>orient8 of Badllus antkrads leflial toxin or nucldc add 
firagments of such coding regions encoding fragments, variants, or derivatives 
thereof, and methods for pnhnndng the iommne response of a vertebrate to 
Badllus antkrads infection by administering to the tissues of a vertebrate one 
or more polynucleotides conq)rising one or more codon-optimized coding 
regions encoding components of Badllus antkrads lethal toxin or nucldc acid 
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fi:agineDt8 of such oodhig regions encoding fiagm^ts, vanants» or derivatives 
thereof. The present invention furtiier provides plasmids and othw 
polynucleotide constructs for delivery of nucleic acid coding sequences to a 
vertebrate which provide e^cpression of BaciUus antkracis toxin components^ 
5 or fiagmmts, variante, or derivatives thereof. 

In certain embodiments, the invention further provides methods for 
enhancing the immune response of a vertebrate to Bacillus antkracis infection 
hy sequentially administering two or more different unmunogemc 
CQUDposilions to the tissues of the vertebrate. Such methods conQsrise initially 

10 administerixig one or more polynucleotides comprising one or more codon- 

optimized coding regions encoding continents of Bacillus anthmeis letiial 
toxin or nucleic acid fragments of such coding regions encoding fiagmrats, 
variants^ or derivatives fh^eo^ to prime imnumity» and then administering 
subsequently a different vaccine composition, for example a recombinant viral 

IS vacdne, a protein subunit vaccine^ or a recombinant or 1^ 

or vaccines to boost die ZEiirBaciUus antkracis toxin immune response in die 
verteihrate. 

BRIEF DESCRIPnOK OF THE DRAWINGS 

Figure 1 shows fiie nucleotide sequence (SBQ ID N0:1) and amino 
20 add translation (SEQ ID N0:2) of TPA-PA63. SEQ ID NOil contains a 

nucleic acid fiagment of a human codon-optimized PA coding region, 
encoding the 63kD fnrin cleavage product of the Bacillus antkracis protective 
antigen (PAX fused to a nucleic acid encoding the human tissue plasminogen 
activator (TPA) signal peptide sequence. Nucleotides 1-12 of SEQ IDNO:l is 
25 a Kozak translation initiation element and nucleotides 13-81 of SEQ ID N0:1 

encode the TPA signal peptide. Nucleotides 82-1782 of SEQ ID N0:1 encode 
tiie 63kD furin processed firagment of PA that can bind LP and EE, and 
hi^tamerize and form a pore in infected cells Ihrough which die toxin is 
delivered. The 63kD fiirin processed fragment of PA correqKinds to amino 



wo 2004/024067 



PCT/US2003/028199 



adds 199-764 of the native full-length PA amino acid sequence of GenBank 
accession No, AAA2263 (SEQ ID N0:4) encoded by GeoBank accession 
nmnbcr M22589 (SEQ ID NO:3), 

Figure 2 shows the nucleotide sequence (SEQ ID NO:S) and amino 

5 add translation (SEQ ID N0:6) of TPA-PA63AF313-314. SEQ ID N0:5 is 

idmtical to SEQ ID N0:1, except that the nucleotides encoding the two 
phenylalanine residues at amino adds 313-314 of SEQ ED N0:2 are deleted, 
whidi results in a PA protein that cannot form the pore through which LP and 
EP are translocated. Nudeotides 1-12 of SEQ ID NO:S is a Kozak translation 

10 initiation element and nudeotides 13-81 of SEQ ID N0:5 encode the TPA 

signal peptide. 

Figure 3 shows the nucleotide sequence (SEQ ID N0:7) and amino 
add translation (SEQ ID NO:8) encoding TPA-PA83 A Furia SEQ ID N0:7 
contains a nucldc add figment of a human codon-optimized PA coding 

15 region, encoding fuU-leogth mature PA (amino adds 30-764 of SEQ ID N0:4) 

with the fuiin cleavage site deleted (SRKKRS, amino adds 192-197 of SEQ 
ID N0:4). This mutant PA cannot be processed to the 63 kD fragmmt and 
cannot bind LF or EF. Nucleotides 1-12 of SEQ ID N0:7 is a Kozak 
translation initiation elraient and nucleotides 13-81 of SEQ ID NO:7 encode 

20 the IPA signal peptide. 

Figure 4 shows the nucleotide sequence (SEQ ID N0:9) and amino 
add translation (SEQ ID NO:10) of TPA-LF HEXXH 
CH686A+H690A+E687D). SEQ ID N0:9 contains a nucleic acid ftagment of 
a human codon-optimized LF coding region, encoding the mature Bacillus 

25 anthrads lethal &ctor with three inactivating point mutations. Eitiier the 

H686A + H690A (decreased Zn bmding and no protease activity) or E687D 
(no protease activity, no in vitro or in vivo macrophage killing) mutation 
inactivates flie enzymatic activity of LF rendering it non-toxic (Hammond S JB. 
and Hanna P.C. Irtfect Jmmun. 55:2374-2378(1998)). This construct 

30 combmes botti sets of mutations. Nucleotides 1-12 of SEQ ID N0:9 is a 

Kozak translation initiation dement and nucleotides 13-81 of SEQ ID N0:9 
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encode the TPA signal peptide. Nucleotides 82- 2412 encode a nontoxic 
fonn of leflial factor. TPA-LF HBXXH (H686A+H690A+E687D) is derived 
fiom the native faU-loigfli LF amino acid sequaice of GenBasak accession No. 
AAA22569 (SEQ ID N0:12) mcoded by GenBank accession number 
M30210(SEQIDNO:11). 

Figure 5 shows the nucleotide sequence (SEQ ID NO:13) and amino 
add translation (SEQ ID N0:14) of TPA-LF Domain HH. SEQ ID N0:13 
contains a nucleic acid fragment of a human codon-optbnized LP coding 
region, encoding an N-temiinal fiagmeot (domains I-IU) of LF corresponding 
to amino adds 34-583 of SEQ ID N0:12^ Nucleotides 1-12 of SEQ ID N0:13 
is a Kozak translation initiation element and nucleotides 13-81 of SEQ ID 
N0:13 encode the TPA sigqal peptide. Nucleotides 824734 of SEQ ID NO:13 
encode domains I-m of LF. The entire protease domain (doniain IV) has been 
deleted. 

Figure 6 shows the nucleotide sequence (SEQ ID N0:15) and amino 
add translation (SEQ ID NO:16) of TPA-LF Domain lA SEQ ID N0:15 
contains a nucldc add fiagment of a human codon-optimized LF coding 
region, encoding an LF N-temiinal jftagment of LF corresponding to amino 
acids 34-254 of SEQ ID NO: 12, Tliis truncated version of LF roughly 
corresponds to the domain I portion of LF that directly binds PA63. Pannifer 
A.D. et al. Nature 414:229-333 (2001). Nucleotides M2 of SEQ ID NO:15 is 
a Kozak translation initiation elemaat and nucleotides 13-81 of SEQ ID 
N0:15 encode the TPA signal peptide- Nucleotides 82-747 of SEQ ID N0:15 
encode domain I of LF. 

Figure 7 shows the nucleotide sequence (SEQ ID NO:17) and amino 
add translation (SEQ ID NO:18) of TPA-PA63 with the N-linked 
glycosylation moti& mutated SEQ ID NO:17 is identical to SEQ ID NO:l, 
except that all ten N-linked glycos^ation sites have been mutated. The N 
reddue m the glycosylation motif (N-X-S/T) has been dia^ 
(Q-X-S/T) resulting in a protem that cannot glycos:^ated at these dtes. 
Nucleotides 1-12 of SEQ ID N0:17 is a Kozak translation initiation element 
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and nucleotides 13-81 of SEQ ID N0:17 encode flie TPA signal peptide. 
Nucleotides 82-747 of SEQ ID N0:15 encode domain I of LF. Nucla)tide8 
82-1782 of SEQ ID NO: 17 encode a mutated fom of tiie 631cD fiixin 
processed fragmeaat of PA that can heptamerize, bind LF and EF, and form a 
pore in infected cells thiou^ which the toxin is delivered. 

Figure 8 shows the nucleotide sequence (SEQ ID N0:19) and amino 
acid translation (SEQ ID NO:20) of sugar-minus TPA-LF HEXXH mutant 
(H686A+H690A+E687D). SEQ ID N0:19 is identical to SEQ ID N0:9. 
excq>t that all sevmN-linked glycosylation sites have been mutated. Ihe N 
residue in Ifae glycosylation motif (N-X-S/T) has been changed to a Q residue 
(Q-X-S/T) resulting in a i»x>tdn that cannot be glycosylated at &6se sites. 
Nucleotides 1-12 of SEQ ID NO: 19 is a Kozak translation initiation element 
and nucleotides 13-81 of SEQ ID N0:19 mcode tiie TPA signal peptide. 
Nucleotides 82- 2412 encode a non-toxic form of leOial fictor which cannot be 
glycosylated. 

Figure 9 shows a nucleotide sequence comparison of a nucldc acid 
fragment of a human codon-optimized PA coding region, encoding PA63 
(nucleotides 82-1782 of SEQ ID N0:1) vs. the native nucleotide sequoKse of 
Bacittus anthracis PA63 (nucleotides 2398^95 of SEQ ID N0:3). 
Differences between the two sequences are denoted with a letter. There is 
approximately 25% difference in the two coding sequences- 
Figure 10 shows a nucleotide sequence conq>arison of a humani zed 
nucleotide sequence encoding flie mature PA A finin (nucleotides 82-2268 of 
SEQ ID NO: 7) vs. the native nucleotide sequence of Bacittus anthracis 
mature PA (nucleotides 1891-4095 of SEQ ID N0:3). Differences betwem the 
two sequences are denoted with a letter and g^s are denoted as a dash. There 
is proximately 25% differmce in the two coding sequences. 

Figure 11 shows a nucleotide sequence conoparison of a hum an i zed 
nucleotide sequence encoding the matore LF A HEXXH (nucleotides 82-2409 
of SEQ ID N0:9) vs. the native nucleotide sequence of BaeiUus antkrads 
mature LF (nucleotides 784-3111 of SEQ ID N0:11). Differences between 



"WO 2004/024067 



PCTAJS2003/028199 



-10- 

the two sequences are deaoted with a letter and gaps aie denoted by a gap. 
There is approximately 25% diflference in the two coding sequaices. 

Figure 12 shows an amino acid con^)aiison between 1PA-PA63 (SBQ 
ID N0:2) and sugar minus TPA-PA63 (SEQ ID N0:18). All ten N-linked 
5 glycosylation sites N-X-S/T in TPA-PA63 have been mutated to Q-X-S/T so 

that they will not be a substcate fisr giycos)4attoiL 

Figure 13 shows an amino acid comparison between TPA- 
LFAHBXXH (SEQ ID NO:10) and sugar minus TPA-LFAHBXXH (SBQ ID 
NO:20). All seven N-linked glycosylation sites N-X-S/T in TPA-PA63 have 
1 0 been mutated to C^X-S/T so that they will not be a substrate for glycdsylation. 

Figure 14 shows Ihe nooleotide sequence (SBQ ID NO:39) and amino 
acid translation (SBQ ID NO:40) of TPA-LF Domain IB. SBQ ID NO:39 
contains a micleic acid fiagment of a human codon-optimized LP coding 
region^ encoding an LF N-taminal fiagment of LF correspondmg to amino 
15 acids 34-295 of SBQ ID N0:12. This truncated version of LF rougjily 

corresponds to Ae domain I portion of LF that directly binds PA63. Panni£sr 
AJy. et al Nature 414:229-333 (2001), Nucleotides 1-12 of SEQ ID NO:39 is 
a Kozak translation initiation element and nucleotides 13-81 of SEQ ID 
NO:39 encode the TPA signal peptide. Nucleotides 82-870 of SBQ ID NO:39 
20 encode domain I of LF. 

Figure 15: Antibody titers measured in mouse immunization 
experiment 1 (Example 11). 15A: protective antigen (PA) titers; 15B: lethal 
factor (LF) titers; and 1 SC: lethal toxin (LT) neutralization titers. 

Figure 16: Antibody titers measured in mouse immunization 
25 experimmt 2 (Bxample 11). 16A: protective antigm (PA) titets; 16B: Idiial 

&ctor (LF) titers; and 16C: leOial toxin QLT) neutralization titers. 

Figure 17: Antibody titers measured in mouse immunization 
experimmt 3 (Bxanqple 11). 17A: protective antigen (PA) titers; 17B: lethal 
fector (LP) titers; and 17C: lethal toxin (LT) neutralization titers* 
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Figuie 18: Antibody titers measured in mouse immunization 
experiment 4 (Example 11). 18A: protective antigen (PA) titers; 185B: lethal 
toxin (LT) neutralization titers. 

Figme 19: Pre-challenge lethal toxin (LT) neotralizatian titers in the 
rabbit immunization e3q}€rimetit (Example 12). 

Figure 20: Antibody titers measured in mouse immunization 
experiment S (Bxanq>le 11). 

Figure 21: Ledial toxin (LT) neutralization titers in mouse 
immunization e?q)erimeat S Sample 1 1). 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed to conq)ositions and methods for 
enhancing the immune response of a vertebrate in need of protection against 
BacUIus mtkracis infection by adnrnnstmng in vivo^ into a tissue of a 
vertebrate a polynucleotide compiistag a human codon-optimized coding 
regLcm encoding a polypeptide of Bacillus anthrads^ or a nucleic acid 
firagmrat of such a coding region encoding a firagmeol, variant, or derivative 
thereof. The polynucleotides are incorporated into the cells of the vertebrate in 
vivo, and an immunologically effective amount of the Bacillus anthrads 
polypeptide, or fiugment or miant is produced in vivo. 

The present invention provides polynucleotide-based vaccines and 
mdhods for delivery of BaciUus anthrads coding sequences to a vertebrate 
with optimal expression and safety confened through codon optimization 
and/or other manipulations. These polynucleotide-based vaccines are prepared 
and administered in such a manner that tibie encoded gene products are 
optimally expressed in the particular vertebmte to ^^ch the composition is 
administered. As a result^ these compositions and meHiods are useful in 
stimulating an hnmune response against Bacillus anthrads infection as the 
coding sequence encodes a polypeptide which stimulates the immune system 
to respond to anthrax infection. Also included in the invention are eaqxression 
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systems, delivery systems, and «)doiirOptimized Bacillus anthrads coding 
sequences. 

A polynucleotide vaccine of the present invention is enable of 
eliciting, without more, an immune re^onse in a vertebrate against A 
anthrads when administered to tiiat vertdrate. Such polynucleotides are 
lefetred to herein as polynucleotide vaccines. 

It is to be noted that the tran "a** or "an" entity refOT to one or more of 
that entity; for exan:qpl^ "a polynucleotide," is understood to rqxresent one or 
more polynucleotides. As such, Ihe teams V (or W), "one or more." and •'at 
least one" can be used interchangeably harem. 

The teims "nucleic add" or ''nucleic add fragment" refers to any one 
or more nucldc add segments, e.g,, DNA or KNA fragments, present in a 
polynucleotide or construct. While tiie tenns "hmcldc acid," as used herein^ is 
meant to include any nucldc add, the term "nucldc add firagment^ is used 
herein to apedficaUy denote a fragmrat of a designed or synthetic codon- 
optimized coding region encoding a polyp^tide, or fragment varian:^ or 
derivative thereof, which has hem optimized according to the codon usage of 
a given specie. As used herein, a "coding region" is a portion of nucldc add 
which consists of codons translated into amino adds. Although a "stop codon" 
(TAG, TGA, or TAA) is not translated into an amino add, it may be 
considered to he part of a coding region, but any flanking sequences, for 
exanq>le promoters, ribosome binding sites, transcriptional terminators, and 
tiie like, are not part of a coding region. Two or more nucldc adds of the 
present invention can be present in a single polynucleotide construct, e.g., on a 
single plasmid, or in separate polynucleotide constructs, eg., on s^arate 
plasmids* Furtiiermore, any nucldc add or nucleic add fiagment noay encode 
a smgle polypqrtide, e.g.^ a single antigen, cytokine, or regulatory 
polypeptide, or may encode more than one polypeptide, eg., a nucldc add 
may encode two or more polypeptides. In addition, a nucldc add may encode 
a regulatory element sudi as a pron:iot6r or a transcrqition tamunator, or may 
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encode a specialized element or motif of a polypeptide or protein, such as a 
secretory signal peptide or a functional domain. 

The tenns "fiagment," Variant," "derivative" and "analog" when 
referring to B. arUhracis polypeptides of the present invention include any 
polypeptides v^ch retain at least some of the immunogenicity or antigenicity 
of the corresponding native polyp^tide. Fragments of B, anihracis 
polypeptides of the present invention include proteolytic firagments, deletion 
fragments and in particulari fragments of B. anthrads polypeptides 
exhibit reduced pathogenicity when delivered to an animaL Polypeptide 
firagmmts further include any portion of the polypeptide v^iich CQmprises an 
antigenic or immunogenic epitope of (he native polypeptide^ including linear 
as well as thxee-demensional q)itopes. Variants of A anthrads polyp^tides 
of the present invention includes fiagments as described above^ and also 
polypq)tidjes with altered amino add sequences due to amino add 
substitutions^ del^ons, or insertions. Variants may occur naturally, sudi as an 
allelic variant By an "allelic variant" is intended alternate forms of a gene 
occupying a given locus on a chromosome of an organism. Genes Lewin, 
B,, ed., John Wiley & Sons, New York (1985). Non-natuially occurring 
variants may be produced using art-known mutagenesis techniques. Variant 
polypeptides may comprise conservative or non-conservative amino acid 
substitutions, deletions or additions. Derivatives of A anihracis polypeptides 
of the present invention, are polypeptides which have been altered so as to 
exhibit additional features not found on the native polypeptide. Examples 
include fusion proteins. An analog is another form of a A anthrads 
polypeptide of the present invention. An example is a proprotein (eg., A 
anthrads PA83) which can be activated by cleavage of the proprotein to 
produce an active mature polypq>tide {e.g., A anthrads PA63). 

The tern ''polynucleotide" is intended to encompass a singular nucldc 
add or nucldc acid fiagment as well as plural nucldc adds or rmcldc add 
fragDxents, and refers to an isolated molecule or construct, e.g. , a virus genome 
(e.g.j a non-infectious viral genome), messenger RNA (mRNA), plasmid DNA 



wo 2004/024067 



PCTAJS2003/028199 



-14- 

(pDNA), or derivatives of pDNA (eg., minicircles as described in parquet, 
A-M et al. Gene Therapy 1341-1349 (1997)) comprisiiig a polyaucleotide. 
A nucleic acid may be provided in linear {e,g,, mRNA), dicular (eg., 
plasmid), or branched form as well as double-stranded or single-stranded 
fonns. A polynucleotide may ccmqirise a conventional phosphodiester bond 
or a non-convaitional bond (ag., an amide bond, such as found in peptide 
nncldc adds (PNA)). 

The tmns "infectious polynucleotide" or "infectious nncldc add'* are 
intended to encompass isolated viral polynucleotides and/or nucldc adds 
which are solely sufficient to mediate the synthesis of conq)lete infectious 
virus particles upon iiptake by permissive cells. "Isolated" means that die viral 
nucldc add does not require pre-syuthesized copies of any of the polyp^tides 
it encodes^ e.g., viral r^licases, in order to initiate its replicalion cycle. 

The temis "non-infectious polynucleotide^ or "^on-infectious nucldc 
add" z& defined herdn which cannot, without additional added materials, e.g, 
polypeptides, mediate fte synthesis of conq^lete infectious virus particles upon 
uptake by permissive cells. An infectious polyimcleotide or nucldc add is not 
made "non-infectious'' simply because it is taken vqi by a non-permissive cell. 
For example, an infectious vital polynucleotide fiom a virus with limited host 
range is infectious if it is capable of mediating the synthesis of conqil^e 
infectious virus particles when taken up by cells derived from a permissive 
host (i.e., a host permissive for the virus itself). The fact that uptake by cells 
derived fiom a non-permisdve host does not r^ult in the synthesis of 
complete infectious virus particles does not make the uucleic acid ''non- 
infectious.'* hi other words, the torn is not qualified by the nature of the host 
cdl, the tissue type, or the 4)edes. 

hi some cases, an isolated infectious polynucleotide or nucleic add 
m^ produce fiiUy-infectious virus particles in a host cell population which 
lades receptors for the virus particles, Le, is nourpermissive for the virus 
it6d£ Thus viruses produced will iiot infect surrounding cells. However, if 
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the supernatant contammg the virus particles is transferred to cells yMch are 
pecmissive for the vims, mfection will take place. 

The terms "replicating polynucleotide" or ''rq>ficatmg nucleic arid" are 
meant to encompass those polynucleotides and/or nucleic adds which, xsporx 
being taken iqp by a penniaave host cell, are cq)able of prodncii^ multiple, 
eg., one or more copies of the same polynucleotide or nucleic add. infectious 
polynucleotides and nucldc adds are a subset of repKcating polynucleotides 
and nucldc adds; the tenns are not synonymous. For example^ a defective 
virus BSDoms laddng llie gcaies for virus coat protdns may rqplicate, e.g., 
produce multiple copies of itsell^ but is NOT infectious because it is inc^>able 
of tnflHintiTig the syzi&esis of coaxpUto mfectious virus particles unless the 
coat proteins, or another nucldc add encoding the coat proteins, are provided. 

In catain embodiments, the polynucleotide, nucldc acid, or nucldc 
add fragment is DNA. In ftie case of DNA, a polynucleotide oonqnising a 
nucldc add which encodes apoiypq>tide normally also comprises a promoter 
operably associated with the polypqptide'-encodjng nucldc add. An operable 
assodation is when a nucldc add encoding a gene product; e.g;, a 
polypeptide, is assodated with one or more regulatory sequences in such a 
way as to place expression of the gene product U3Dder the influence or control 
of the regulatory sequence(s). Two DNA fragments (such as a polypq>tide- 
encoding nucldc acid and a promoter associated with the 5' end of the nucldc 
add) are "operably assodated" if induction of promoter frinction results in the 
transcription of mRNA encoding the desired gene product and if the nature of 
the linkage between the two DNA fragments does not (1) result in the 
inboduction of a frame-shift mutation, (2) interfere with the ability of the 
expression regulatory sequences to direct the expression of the gene product, 
or (3) mterfere with the ability of the DNA tenq>late to be transcribed. Thus, a 
promoter region would be operably assodated with a nucldc add encoding a 
polypeptide if the promoter was capable of effecting transcription of that 
nucldc add. The promoter may be a cell-specific promoter that directs 
substantial transcription of the DNA only in predetemoined cdls. Other 
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Iransaription control elemeots, besides a promoter, fi)r exa]XQ>le enhaocerB, 
operators, repressors, and transcription teonination signals, can be operably 
associated with the polynucleotide to direct cell-specific transcription. 
Suitable promoters and other transcription control regions are disclosed herein. 

5 A variety of transcription control regions are known to those skilled in 

the art These include, without limitation, transcription control regions vMcb 
fimction in vertebrate cells, such as, but not limited to, promoter and enhancer 
segments ficom cytomegaloviruses (the immediate early promoter, in 
corgunction with intton-A), simian virus 40 (the early promote:), retroviruses 

10 (sudi as Rous sarcoma virus), and picomavinises (particularly an intemal 

ribosome entry site, or IRBS, also referred to as a CTIB sequence)* Other 
transcription control regions include those derived fiom vertebrate genes such 
as actin, heat shock protein, bovine g^>wth hormone and rabbit P-globin, as 
well as other sequences c£^able of controlling gene expression in eukaryotic 

15 cells. Additional suitable transcrq>tion control regions include tissue^ecific 

promoters and enhancers as well as lymphokine-inducible promoters (e.;., 
promoters inducible by inteaf oons or mterleukins). 

In one embodiment, a DNA polynucleotide of tiie present invention is 
a circular or linearized plasmid, or other Knear DNA winch is, in certain 

20 embodiments, non-infectious and nonintegrating (ie., does not integrate into 

the genome of vertd>rate cells). A linearized plasmid is a plasmid that was 
previously circular but has been Hnearized, &r example, by digestion with a 
restriction endonuclease. 

Alternatively, DNA virus genomes may be used to adinitiistcr DNA 

25 polynucleotides into vertebrate cells. In certain embodiments, a DNA virus 

geoome of the present invention is noninfectious, and nonintegrating. Suitable 
DNA virus genomes include herpesvmis gnomes, adenovirus genomes, 
admo-assodated virus genomes, and poxvirus genomes. Reference citing 
methods for the in vivo introduction of non-infectious virus genomes to 

30 vertebrate tissues are well known to those of ordmary skill in tiie art, and are 

cited iSi^pra. 
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In other embodimeiits, a polynucleotide of the present invrntion is 
RNA. Ill a suitable embodiment, tiie RNA is in tte fonn of messenger KNA 
(mKNA). Mefhods for introducing RNA sequences into vertebrate cells are 
desaibed in U.S. Patent No. 5,580,859, the disclosure of which is 
incorporated herein by reference in its entirety. 

Polynucleotide^ nucleic acids, and nucleic acid fragments of the 
present invention may be associated with additional nucleic adds v^ch 
mcode secretory or signal peptides, which direct the secretion of a 
polypeptide racoded by a nucleic add or polynucleotide of the present 
invention. AccraJing to the signal hypothesis^ proteins secreted by 
mammMm cells havd a signal peptide or secretory leader sequence which is 
cleaved from the mature protein once export of the growing protein chain 
across the rough endoplasmic reticulum has been initiated. Those of ordiaazy 
skill in the art are aware that polypeptides secreted by vertebrate cells 
generally have a signal peptide fiised to the N-terminus of the polypeptide^ 
which is cleaved from the coniplete or "full lenglh" polypeptide to produce a 
secreted or •^mature" form of the polypeptide. In certain embodiments, the 
native leader sequence is used, or a functional derivative of that sequence tiiat 
retains the ability to direct the secretion of the polypeptide that is operably 
associated with it Alternatively, a heterologous mammalian leader sequence, 
or a functional derivative thereof may be used. For example, the wild-type 
leader sequence may be substituted mlh the leader sequence of human tissue 
plaaminogoa activator (IPA) or mouse ^-glucuronidase. 

In accordance with one aspect of the present invention, diere is 
provided aplasmid for expression of zBacUltds anthracis PA or LF-derived 
coding sequence optimized for esqiresaion in the particular vertebrate ^>ecies 
to be treated or immmiized, Whm such a plasmid is delivered, in vivo to a 
Ussae of the vertebrate to be treated or immunised, the transcrq^tional unit will 
thus e3q)ress fhe encoded gene product The level of expression of the gene 
pioduct will depend to a significant extent on fhe strength of the associated 
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promoter and &6 presence and activation of an associated enhancer element 
as well as the optimization of the coding region. 

As used herein, the term '"plasmid" refers to a construct made \ip of 
genetic material (la^ nucleic acids). Typically a pla^d contains an origin of 
replication which is functional in bacterial host cells, ^g-f Eschericha coUy and 
selectable markers far detecting bacterial host cells comprising the plasmid 
Plasmids of the pr^ent invention may inchuie genetic elements as described 
herein arranged such that an inserted coding sequence can be transcribed in 
eukaryotic cells. Also, while tiie plasmid may indude a sequence fiom a viral 
nucleic add, sudi viral sequence normally does not cause the uicoiporati(»i of 
the plasmid into a viral particle, and the plasmid is thearefisre a non-viral 
vector. In certain embodiments described hoein, a plasmid is a dosed drcular 
DNAmoleciila 

The term "expression" refers to the biological production of a product 
encoded by a coding sequence. In most cases a DNA sequence, including the 
coding sequence, is transcribed to form a mess«ger-SNA (mKNA). Tb& 
messeoger-RNA is translated to form a polypeptide product which has a 
relevant biological activity. Also, the process of eTcpression may involve 
fiutber processing stq)s to the KNA product of transcription, such as splicing 
to remove introns, and/or post-translational processing of a polypeptide 
product 

As used herein, the temi "polypeptide'^ is intended to encompass a 
singular ^^polypeptide" as well as plural ^^polypeptides," oxad comprises any 
chain or chains of two or more mono acids. Thus, as used herein, tenns 
including, but not limited to "peptide," "dipqptide," '*tripeptide," "protein," 
'^amino acidchain,'^ or any otbo: termusedto lefer to a chain or chains of two 
or more amino acids, are included in the definition of a *^lypeptide>" and the 
team ^lypqitid^* may be used mstead of, or interchangeably with any of 
these terms. The term further iududes polypeptides whidi have undergone 
post-translational modifications, for example^ glycosylation, acetylation, 
phosphorylation, amidation, derivatization by known protecting/blocking 
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groiq)S9 proteolytic cleavage^ or modification by son-natuiaUy occucriiig 
anuno acids. 

Also included as polypeptides of the pr^ent iaveaition are j&agcients, 
denvatives, analogs, or vanants of the foregoing polypeptides, and any 
combination thereof. Po]ypq>tides, and fiagments, derivatives, analogs, or 
variants thereof of the present invention can be antigenic and immunogenic 
polypeptides related to B. anthracis polypeptides, which are used to prevent or 
treat, Le.^ cure^ ameliorate, lessen tiie sevedty of; or prevent or reduce 
contagion of mfectious disease caused by A anthracis . 

As i&ed herein, an antigenic polypeptide or an immuno^nic 
polypq>tide ]s a polypq>tide which, when introduced uito a vertebrate, reacts 
with the immune system molecules of the v^ebrate, le,^ is antigenic, and/or 
induces an immune response in the vertebrate, Le,^ is inomunogaiic. It is quite 
likely that an immunog^c polypeptide will also be antigenic, but an antigenic 
jpolypq)tide, because of its size or conformation, may not necessarily be 
immuno^iic. Examples of aiitigemc and immunog^c polypeptides of the 
present inventicm include, but are not limited to, A anthracis protective 
antigpn (PA) or lethal factor (LF)i fragments thereof e.g., PA63, LP domains 
Un or domain I, variants thereof, e,g., PA63A FF, PA83 A furin, PA63 sugar 
minus, UP HEXXH, or LP sugar minus (all described in more detail herein) 
and derivatives thereof, e.g., any of the foregoiug polypeptides fused to a TPA 
signal peptide. 

The term "epitopes," as used herein, refers to portions of a polypeptide 
having antigenic or immunogenic activity in an animal, for example a 
mflrmriiil, foT exaoople, a humaa An "immunogenic epitope," as used herein, is 
defined as a portion of aprotein tliat elicits an immune refuse in an animal, 
as detenmned by any method known in the art The tenn '^antigenic qiitope," 
as used herdn, is defined as a portion of a protdn to which an antibody can 
imimmospedficaUy bind its antigen as detennined by any method well known 
inflieaxt Iannun0q>ecific binding excludes non-specific binding but does 
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necessarily exclude cross-reactivity with oflier antigeiis. Antigenic epitopes 
need not necessarily be immunogenic. 

In the present invention, antigenic qpitopes pre&rably contain a 
sequence of at least 4, at least 5, at least 6, at least 7» at least 8, at least 9» at 
least 10, at least 15, at least 20, at least 2S, or between about 15 to about 30 
amino acids contained within the amino acid sequence of a polyp^tide of the 
invention. Certain polypqptides conopising immunogenic or antigenic 
epitopes are at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 
85, 90, 95, or 100 amino acid residues in length. Antigmic as well as 
inmnmo^nic qntopes may be linear, Le., be comprised of contiguous amino 
adds in a polypeptide, or may be three dimensional, ie.^ where sn qiitope is 
conqnised of nourcontiguous amino acids which come together due to the 
secondary or tertiary structure of the polypeptide, thereby forming an qpitope. 

The present invention is directed towards polynucleotides oomi»ising 
nucleic add fiagmais of codon-Kiptimized coding regions whidi encode 
polypeptides of Bacillus antkracis, and in particular, Badlbis anthracis 
protective antigen (PA) or lethal Actor (LF), and firagments, variants, or 
derivatives thereof. 

'^Codon optimization" is defined as modifyuig a nucleic add sequence 
for ^dtianced expression in the ceUs of the vertebrate of interest by replacing at 
least one, more than one, or a significant number, of codons of the native 
sequence with codons that are more frequently or most frequently used in the 
genes of that vertebrate. Various species exhibit particular bias for certain 
codons of a particular amino acid. 

The present invention relates to polynucleotides conq)rLsing nucldc 
add fragments of codoxH)ptimized coding re^ons whidi encode Bacillus 
antiiracis polypeptides, with the codon us^ adapted far optimized 
expression in the cells of a given vertebrate. These polynucleotides are 
j^ared by incorporating codons prefisoed for use in die geiies of a given 
spedes into the DNA sequence. Also provided are polynucleotide eq>ression 
constructs, vectors^ host cells compri^ng nucldc add fragments of codon- 
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optmuzed coding regions which encode Bacillus anthracis polypeptides, and 
various methods of using the polynucleotide expression constructs, vectors, 
host cells to treat or prevent anthrax in a vertebrate . 

Codon Optimization 

As used herein the term '^codon optimized coding region" means a 
nucleic acid coding region that has been adspted fi)r expression in the cells of 
a given vertebrate by rqplacing at least one, or more than one, or a significant 
number, of codons with one or more codons that are more firequently used in 
the genes of that vertebrate. 

Deviations in the nucleotide sequence that comprise the codons 
encoding the amino acids of any polypeptide chain allow for variations in the 
sequence codmg for &e gene. Since each codon consists of three nuclrotides, 
and the nucleotides comprising DNA are restricted to four specific bases, there 
are 64 possible combinations of nucleotides, 61 of which encode amino adds 
(the remaining three codons encode signals ending translation). The "genetic 
code" which shows which codons encode vMdk amino acids is reproduced 
herein as Table 1. As a result, niany amino acids are designated by more than 
one codon. For example, the amino adds alanine and proline are coded for by 
four triplets, serine and arginine by six, whereas tryptophan and mefliionine 
are coded by just one tsipleL This degeneracy allows for DNA base 
conq)Osition to vary over a wide range without altering the amino add 
sequence of the proteins encoded by the DNA. 
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TABLB 1: The Standard Genetic Code 





T 


C 


A 


G 


T 


TTTPhe(F) 
TTC" 

ITA Leu (L) 
TTG" 


rCTSer(S) 

rcc" 

rcA 

rcG" 


TATTyrOO 
lAC" 
lAA xer 
TAGTer 


TGTCys(C) 
TGC 

TnA Tfti" 

TGGTcp(W) 


\^ 


CTTLea(L) 
CTC" 

CTG" 


CCTPk>(P) 
CCC" 

CCG" 


CAT His (H) 
CAC" 

PA A ran ^'O^ 
\^jt\i\ villi 

CAG" 


CGTArg(R) 
CGC" 
CGA" 
CGG" 


A 


ATTleO) 

ATC" 

ATA" 

ATGMet(M) 


ACTThrCT) 
ACC" 
ACA" 
ACG" 


AATAs3i(N) 
AAC" 

AAALys(K) 
AAG" 


AGTSer(S) 
AGC" 

AGAAig(R) 
AGG" 


G 


GTTVal(V) 
GTC" 
GTA" 
GTG" 


GCTAla(A) 
GCC" 
GCA" 
GCG" 


GATA^CD) 
GAC" 

GAAGhiCE) 
GAG" 


GGTGly(G) 
GGC" 
GGA" 
GGG" 



5 Many organisms display a bias for use of particular codons to code for 

insertion of a particular amino add ^in a growing peptide chain. Codon 
prefecence or codon bias» di£fercnces in codon usage between organisms^ is 
afforded by degeneracy of the genetic code» and is well documented among 
many organisnis. Codon bias often correlates with the effideocy of translation 

10 of messenger RNA (mSNA), which is in tum believed to be dependent on» 
inter alia^ the properties of the codons being translated and Ifae ayailability of 
particular transfer RNA (tRNA) molecules. The predominance of selected 
tRNAs in a cell is generally a reflection of the codons used most firequently in 
peptide synthesis. Accordingly, genes can be tailared for optimal ^e 

IS expression in a given organism based on codon optimization. 

Given the large number of gene sequences available for a wide variety 
of animal, plant and microbial spedes, it is possible to calculate the relative 
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fi^uencies of codon usage. Codon usage tables are readily available, for 
exaii:q)le, at the "Codon Usage Database" available at 
http://wwwJcazusa.orjp/codQn/ (visited July 9, 2002), and these tables can be 
adapted in a number of ways. See Nakacoura, Y,, et al "Codon usage 

5 tabulated from the intonational DNA sequence databases: status for the year 

2000" NucL Acids Res. 28:292 (2000), Codon usage tables for human, 
mouse, domestic cat, and cow, calculated from GenBank Release 128.0 [15 
February 2002], are rq)ioduced below as Tables 2-S. These tables use mRNA 
nom^lature, and so mstead of Ihymine (T) which is found in DNA^ the 

10 tables use uracil (U)^dL is found in RNA. The tables have been adapted so 

that frequencies are calculated for each amino acid^ rather than for all 64 
codons. 

TABLE 2: Codon Usage Table fr>r Human Genes {Homo sapiens) 



Amino Add 


Codon 


Number 


Frequency 


Phe 


UUU 


326146 


0.4525 


Phe 


UUC 


394680 


0.5475 


Total 




720826 






Leu . 


UUA 


139249 


0.0728 


Leu 


UUG 


242151 


0.1266 


Lea 


CUU 


246206 


0.1287 


Leo 


cue 


374262 


0.1956 


Lea 


CUA 


133980 


0.0700 


Leu 


OJG 


777077 


0.4062 


Total 




1912925 








He 


AUU 


303721 


0.3554 


ne 


AUG 


414483 


0.4850 


ne 


AUA 


136399 


0.1596 


Total 




8S4603 






Met 


AUG 


430946 


1.0000 


Total 




430946 






Val 


GUU 


210423 


0.1773 


Val 


GUC 


28244S 


02380 


Val 


GUA 


134991 


0.1137 
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Amino Acid 


Codon 


Number 


Frequency 


Val 


GUG 


559044 


0.4710 


Total 




1186903 






Ser 


UCU 


282407 


0.1840 


Ser 


UCC 


336349 


0.2191 


Ser 


UCA 


225963 


0.1472 


Ser 


UCG 


86761 


0.0565 


Ser 


AGU 


230047 


0.1499 


Ser 


AGC 


373362 


0.2433 


Total 




1534889 






Pro 


ecu 


333705 


0.2834 


Pro 


CCC 


386462 


0.3281 


Pro 


CCA 


322220 


0.2736 


Pro 


CCG 


135317 


0.1149 


Total 




1177704 






Hir 


ACU 


247913 


0.2419 


Thr 


ACC 


371420 


03624 


Thr 


ACA 


285655 


0.2787 


Thr 


ACG 


120022 


0.1171 


Total 




1025010 






Ala 


Gca 


360146 


0.2637 


Ala 


GCC 


551452 


0.4037 


Ala 


GCA 


308034 


0.2255 


Ala 


GCG 


146233 


0.1071 


Total 




1365865 






Tyr 


UAU 


232240 


0.4347 




UAC 


301978 


0.5653 


Total 




534218 






His 


CAU 


201389 


0.4113 


EGs 


CAC 


288200 


0.5887 


Total 




489589 






Ofai 


CAA 


227742 


0.2541 


Ghi 


CAG 


668391 


0.7459 


Total 




896133 
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Amino Acid 


Codon 


Number 


Frequeacy 


Asn 


AAU 


322271 


0.4614 


Am 


AAC 


376210 


0.5386 


Total 




69S481 






LyB 


AAA 


462660 


0.4212 


Lya 


AAG 


635755 


0.5788 


Total 




1098415 






Aso 


GAU 


430744 


0.4613 


Asp 


GAC 


502940 


0.5387 


Total 




933684 






Gltt 


GAA 


561277 


0.4161 


Gltt 


GAG 


787712 


0.5839 


Total 




1348989 






Cys 


UGU 


190962 


0.4468 


Cys 


UGC 


236400 


0.5532 


Total 




427362 






Trp 


UGG 


248083 


1.0000 


Total 




248083 






Arg 


CGU 


90899 


0.0830 


Aig 


CGC 


210931 


0.1927 


Arg 


CGA 


122555 


0.1120 


Aig 


CGG 


228970 


0.2092 


Aig 


AGA 


221221 


0.2021 


Are 


AGG 


220119 


0.2011 


Total 




1094695 






Gly 


GGU 


209450 


0.1632 


Gly 


GGC 


441320 


0.3438 


Gly 


GGA 


315726 


0.2459 


Gly 


GGG 


317263 


0.2471 


Total 




1283759 






Stop 


UAA 


13963 




Stop 


UAG 


10631 




Stop 


UGA 


24607 
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TABLE 3: Codon Usage Table for Mouse Geies (Mus musadus) 



Amino Acid 


Codon 


Number 


Frequency 


Phe 


UUU 


150467 


0.4321 


Phe 


UUC 


197795 


0.5679 


Total 




348262 






Lea 


UUA 


55635 


0.0625 


Lea 


UUG 


116210 


0.1306 


Leu 


CUU 


114699 


0.1289 


Lea 


cue 


179248 


0.2015 


Lea 


CUA 


69237 


0.0778 


Leu 


CUG 


354743 


03987 


Total 




889772 








Be 


AUU 


137513 


03367 


He 


AUG 


208533 


0.5106 


He 


AUA 


62349 


0.1527 


Total 




408395 






Met 


AUG 


204546 


1.0000 


Total 




204546 






Val 


GUU 


93754 


0.1673 


Val 


GUC 


140762 


0,2513 


Val 


GUA 


64417 


0.1150 


Val 


GUG 


261308 


0.4664 


Total 




560241 






Ser 


UCU 


139576 


0.1936 


Ser 


UCC 


160313 


0.2224 


Ser 


UCA 


100524 


0.1394 


Sct 


UCG 


38632 


0.0536 


Ser 


AGU 


108413 


0.1504 


Ser 


AGC 


173518 


0.2407 


Total 




720976 






Pro 


ecu 


162613 


0.3036 


Pro 


cee. 


164796 


03077 


Pro 


CCA 


151091 


0.2821 


Pro 


ecG 


57032 


0.1065 


Total 




535532 
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Ammo Acid 


Codon 


Number 


Frequency 


Thr 


ACU 


119832 


0^72 


Thr 


ACC 


172415 


0.3556 


Thr 


ACA 


140420 


0J1896 


Thr 


ACG 


52142 


0.1076 


Total 




4S4809 






Ala 


GCU 


178593 


02905 


Ala 


GCC 


236018 


0.3839 


Ala 


GCA 


139697 


02272 


Ala 


GCG 


60444 


0.0983 


Total 




614752 






Tyr 


UAU 


108556 


0.4219 


Tyr 


UAC 


148772 


0.5781 


Total 




257328 






Bis 


CAU 


88786 


0.3973 


His 


CAC 


134705 


0.6027 


Total 




223491 






Gin 


CAA 


101783 


02520 


Gin 


CAG 


302064 


0.7480 


Total 




403847 






Asn 


AAU 


138868 


0.4254 


Am 


AAC 


187541 


0.5746 


Total 




326409 






Lys 


AAA 


188707 


0.3839 


Lya 


AAG 


302799 


0.6161 


Total 




491506 






Asp 


GAU 


189372 


0.4414 


Asp 


GAC 


239670 


0.5586 


Total 




429042 






Glu 


GAA 


235842 


0.4015 


Gla 


GAG 


351582 


0.5985 


Total 




587424 






Cys 


UGU 


97385 


0.4716 


Cys 


UGC 


109130 


0.5284 


Total 




206515 
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Amino Add Codon iNuxnber | Frequency 




Trp 


UGG 


112588 


1.0000 


Total 




112588 






Arg 


CXKJ 


41703 


0.0863 


Aig 


CGC 


86351 


0.1787 


Arg 


CGA 


58928 


0.1220 


Arg 


CGG 


92277 


0.1910 


Arg 


AGA 


101029 


0.2091 


Arg 


AGO 


102859 


02129 


Total 




483147 






Gly 


GGU 


103673 


0.1750 


Gly 


GGC 


198604 


0.3352 


Gly 


GGA 


151497 


0J2557 


Gly 


GGG 


138700 


0.2341 


Total 




592474 






Stop 


UAA 


5499 




Stop 


UAG 


4661 




Stop 


VGA 


10356 





TABLE 4: Codon Usage Table for Domestic Cat Genes (Felis cattus) 



Amino Acid 


Codon 


Numbs: 


Fxequency of usage 


Phe 


UUU 


1204.00 


0.4039 


Phe 


UUC 


1777.00 


0.5961 


Total 




2981 






Leu 


UUA 


404.00 


0.0570 


Leu 


UUG 


857.00 


0.1209 


Leu 


CUU 


791.00 


0.1116 


Leu 


cue 


1513.00 


0.2135 


Leu 


CUA 


488.00 


0.0688 


Leu 


CUG 


3035.00 


0.4282 


Total 




7088 








He 


AUU 


1018.00 


0.2984 


Be 


AUC 


1835.00 


0.5380 


He 


AUA 


558.00 


0.1636 


Total 




3411 
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Amino Add Codon | Number Freqoency of usa^ 




Met 


AUG 


1553.00 


0.0036 


Total 




1553 






Val 


GUU 


696.00 


0.1512 


Val 


GUC 


1279.00 


0.2779 


Val 


GUA 


463.00 


0.1006 


Val 


GUG 


2164.00 


0.4702 


Total 




4602 






Ser 


UCU 


940.00 


0.1875 


Ser 


UCC 


1260.00 


0.2513 


Ser 


UCA 


608.00 


0.1213 


Sa 


UCG 


332.00 


0.0662 


Ser 


AGU 


672.00 


0.1340 


Ser 


AGC 


1202.00 


0.2397 


Total 




5014 






Pro 


ecu 


958.00 


02626 


Pro 


CCC 


1375.00 


0.37© 


Pro 


CCA 


850.00 


0.2330 


Pro 


CCG 


465.00 


0.1275 


Total 




3648 






Thr 


ACU 


822.00 


0.2127 


Thr 


ACC 


1574.00 


0.4072 


Hur 


ACA 


903.00 


0.2336 


Thr 


ACG 


566.00 


0.1464 


Total 




3865 






Ala 


GCU 


1129.00 


0.2496 


Ala 


GCC 


1951.00 


0.4313 


Ala 


GCA 


883.00 


0.1952 


Ala 


GCG 


561.00 


0.1240 


Total 




4524 






Tyr 


UAU 


837.00 


0.3779 


Tyr 


UAC 


1378.00 


0.6221 


Total 




2215 






Tfig 


CAU 


594.00 


0.3738 


His 


CAC 


995.00 


0.6262 


Total 




1589 
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Amino Acid Codon Number | Frequency of usage 




Gin 


CAA 


747.00 


0.2783 


Gb 


CAG 


1937.00 


0.7217 


Total 




2684 






Aaa 


AAU 


1109.00 


0.3949 


Asn 


AAC 


1699.00 


0.6051 


Total 




2808 






Lys 


AAA 


1445.00 


0.4088 




AAG 


2090.00 


0.5912 


Total 




3535 






Asp 


GAU 


1255.00 


0.4055 


Asp 


GAG 


1840.00 


0.5945 


Total 




3095 






Gh 


GAA 


1637.00 


0.4164 


Gk 


GAG 


2294.00 


0.5836 


Total 




3931 






Cys 


UGU 


719.00 


0.4425 


Cys 


UGC 


906.00 


0.5575 


Total 




1625 






Trp 


UGG 


1073.00 


1.0000 


Total 




1073 






Arg 


CGU 


236.00 


0.0700 


Arg 


CGC 


629.00 


0.1865 


Arg 


CGA 


354.00 


0.1050 


Aig 


CGG 


662.00 


0.1963 


AlK 


AGA 


712.00 


0.2112 


Arg 


AGG 


779.00 


0.2310 


Total 




3372 






Gly 


GGU 


648.00 


0.1498 


Gly 


GGC 


1536.00 


0.3551 


Gly 


GGA 


1065.00 


02462 


Gly 


GGG 


1077.00 


0.2490 


Total 




4326 
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Amino Acid 


Codon 


Number 


Fireqaency of usage 


Stop 


UAA 


55 




Stop 


UAG 


36 




Stop 


UGA 


no 





TABLE S: Codon Usage Table for Cow Genes {Bos towns) 



Anmio Add 


Codon 


Number 


Frequent of usage 


Phe 


UUU 


13002 


0.4112 


Fhe 


uuc 


18614 


0.5888 


Total 




31616 






Lea 


XJUA 


4467 


0.0590 


Leu 


UUG 


9024 


0.1192 


Leu 


cuu 


9069 


0.1198 


Leu 


cue 


16003 


0.2114 


Leu 


CUA 


4608 


0.0609 


Leu 


cue 


32536 


0.4298 


Total 




75707 








ne 


AUU 


12474 


0.3313 


He 


AUG 


19800 


0.5258 


He 


AUA 


5381 


0.1429 


Total 




37655 






Met 


AUG 


17770 


1.0000 


Total 




17770 






Val 


GUU 


8212 


0.1635 


Val 


GUC 


12846 


02558 


Val 


GUA 


4932 


0.0982 


Val 


GUG 


24222 


0.4824 


Total 




50212 






Sear 


UCU 


10287 


0.1804 


Ser 


UCC 


13258 


0.2325 


Ser 


UCA 


7678 


0.1347 


Set 


UCG 


3470 


0.0609 


Sa 


AGU 


8040 


0.1410 


Sot 


AGC 


14279 


0.2505 


Total 




57012 
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Ainino Add 


Codon 


Kumbo- 


Frequency of usage 


Pro 


ecu 


11695 


0^684 


Pro 


CCC 


15221 


0.3493 


Pro 


CCA 


11039 


0.2533 


Pro 


CCG 


5621 


0.1290 


Total 




43576 






Thr 


ACU 


9372 


0.2203 


Thr 


ACC 


16574 


0J895 


Thr 


ACA 


10892 


0.2560 


Thr 


ACG 


5712 


0.1342 


Total 




42550 






Ala 


GCU 


13923 


0.2592 


Ala 


GCC 


23073 


0.4295 


Ala 


GCA 


10704 


0.1992 


Ala 


GCG 


6025 


0.1121 


Total 




53725 






Tvr 


UAU 


9441 


0.3882 


Tyr 


UAC 


14882 


0.6118 


Total 




24323 






His 


CAU 


6528 


0.3649 


His 


CAC 


11363 


0.6351 


Total 




17891 






Gin 


CAA 


8060 


0.2430 


Gb 


CAG 


25108 


0.7570 


Total 




33168 






Asn 


AAU 


12491 


0.4088 


Asa 


AAC 


18063 


0.5912 


Total 




30554 






Lys 


AAA 


17244 


0.3897 


Lys 


AAG 


27000 


0.6103 


Total 




44244 






Asp 


GAU 


16615 


0.4239 


Asp 


GAC 


22580 


0.5761 


Total 




39195 
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Amino Add 


Codon 


Number 


Frequency of Tisage 


Glu 


GAA 


211Q2 


0.4007 


Ohi 


GAG 


31555 


0.5993 


Total 




52657 






Cys 


UGU 


7556 


0.4200 


Cys 


UGC 


10436 


0.5800 


Total 




17992 






Tip 


UGG 


10706 


1.0000 


Total 




10706 






Arg 


CGU 


3391 


0.0824 


AiB 


CGC 


7998 


0.1943 


Arg 


CGA 


4558 


0.1108 


Arg 


CGG 


8300 


0.2017 


Arg 


AGA 


8237 


Ow2001 


Arg 


AGG 


8671 


0.2107 


Total 




41155 






Glv 


GGU 


8508 


0.1616 


Gly 


GGC 


18517 


0.3518 


Gly 


GGA 


12838 


0.2439 


Gly 


GGG 


12772 


0.2427 


Total 




52635 






Stop 


UAA 


555 




Stop 


UAG 


394 




.StoE 


UGA 


392 





By utilizing these or siinilaT tables^ one of oidinaiy skill in the art can 
apply tibie fiequendes to any given polypeptide sequence, and produce a 
nndeic acid jEragment of a codon-optiauzed coding r^on which encodes the 
S pdlyp^tide» but wfaidi uses codons optimal &t a given q>ecies. Codour 

optimized coding regions can be designed by various different methods. 

In one method, a codon usage table is used to find the single most 
frequent codon used for any given amino add, and that codon is used each 
time ftat particular anmra add ^ears in the polypeptide sequmce. For 
10 example, referring to Table 2 above, for leucine, the most frequent codon is 
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CUG, which is used 41% of flie time. Thus all the leucine residues in a given 
amino acid sequence would be assigned the codon CUG. Hiunan codon- 
optimized xmcleotide seque[K:es encoding native PA (GenBank Accession 
Number AAA2263 (SEQ ID N0:4)) and native LF (GenBank Accession 
5 Number AAA22569 (SEQ E) NO: 12)) which have been optimized using this 

method are presmted herein as SEQ ID N0:21 and SEQ ID NO:22, 
respectively. 

In another method, the actual fiequencies of the codons are distributed 
randomly througjhout flie coding sequence. Thus using this method for 

10 optimization, if a hypothetical polypeptide sequence had 100 leucine residues, 

referring to Table 2 for frequency of usage in the humans, about 7, or 7% of 
the leucme codons would be UUA» about 13, or 13% of the leucine codcms 
would be UUO, about 13, or 13% of tiie leucine codons would be CUU, about 
20, or 20% of die leucine codons would be CUC, about 7, or 7% of the leucme 

IS codons would be CUA, and about 41^ or 41% of the leucine codons would be 

CUG. These fiequendes would be distributed randomly throughout the 
leucine codons in the coding region encodmg the hypothetical polypq)tide. 
As will be understood by those of ordinary skill in the art, the distribution of 
codons in the sequence will can vary significantiy using tins method, however, 

20 the sequence always racodes flie same polypeptide. Three different human 

codon-optimized nucleotide sequences encoding native FA (GenBank 
Accession Number AAA2263 (SEQ ID N0:4)) which have been optimized 
using fliis method are presented herein as SEQ ID NO:23, SEQ ID NO:24, and 
SEQDDNO:25. Three different human codon-optimized sequences encoding 

25 native LF (GcaiBank Accession Number AAA22569 (SEQ ID NO: 12)) which 

have been optimized using this method are presented herein as S£Q ID N0:21 
and SEQ ID NO:22, respectively. 

When using flie latter method, the term "about" is u^ precisely to 
account for fractional percrata^ of codon frequencies &r a given amino 

30 acid. As used herein, "about" is defined as one amioo add moie or one amino 

acid less than the value given. The whole number value of amino acids is 
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roimded if the fiitctional fiequency of usage is 0.50 or greater, and is 
lounded down if the fractional frequency of use is 0.49 or less. Using again the 
example of the frequency of usage of leucine in human genes for a 
hypothetical polypeptide having 62 leucine residues, the fractional frequency 

5 of codon usage would be calculated by multiplying 62 by the frequCTcies for 

the various codons. Thus, 7.28 percent of 62 equals 4.51 UUA codons, or 
"about S;* Le,, 4, 5, or 6 UUA codons, 12.66 percent of 62 equals 7.85 UUG 
codons or "about 8," ie,, 7, 8, or 9 UUG codons, 12,87 percent of 62 equals 
7,98 CUU codons, or "about 8," 1 c, 7, 8, or 9 CUU codons, 19.56 percent of 

10 62 equals 12-13 CUC codons or "about 12," i.e., 11, 12, or 13 CUC codons, 

7.00 percent of 62 equals 4.34 CUA codons or "about 4." ie., 3, 4, or 5 CUA 
codons, and 40.62 pocent of 62 equals 25.19 CUO codons, or "about 25," le., 
24, 25, or 26 CUG codons. 

Randomly assigning codons at an optimized frequency to encode a 

15 ffvea polypeptide sequence, can be done manually by calculating codon 

frequencies for each amino acid, and then assigning the codons to the 
polypeptide sequrace randomly. Additionally, various algorithms and 
con5)uter software progi^ms are readily available to those of ordinary skill in 
the art. For example, the "EditSeq" function in the Lasergene Package, 

20 available from DNAstar, Inc., Madison, WI, the backtranslation ftmction in 

the VectorNn Suite, available from InfoiMax, Inc., Bethesda, MD, and the 
"backtranslate" function in the GCG-Wisconsin Package, available from 
Acceliys, Inc., San Diego, CA In addition, various resources aire publicly 
available to codoo-optimize coding region sequences. For eacample, the 

25 ^'backtranslation" frmction at 

http-7/www.enteledion.coni/m^backlranslationJit^ (visited July 9, 2002), 
tiie "backtranseq*' frmction available at 

http-y/bioinfo.pbi.src.ca:809Q/B/IBOSS^dexiit^ (visited July 9, 2002). 
Constructing a rudimentary algorithm to assign codons based on a given 

30 frequency can also easily be acconq^lished vdtti basic mathematical fimctions 

by one of ordinary skill 
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A number of options are available fer synthesjziTig codon optmiized 
coding regions designed by any of the mdhods described above, using 
standard and routine molecular biological manipulations well known to those 
of ordinary skill in the art hi one ^proach, a saies of complementary 
oligonucleotide pairs of 80-90 nucleotides each in Imgth and spanning tiie 
length of the desired sequence are synthesized by standard methods. These 
oHgoxuicleotide pairs are synthesized such fbat upon annealing, they fotm 
double stranded fragments of 80-90 base pairs, contaming cohesive ends, e.g., 
each oUgonucleotide in the pair is syndiesized to extend 3, 4, S, 6, 7, 8, 9, 10, 
or more bases beyond the region that is complementary to the other 
oligonucleotide in the pair. The single-stranded ends of each pair of 
oligonucleotides is designed to anneal with the single-stranded end of anotiier 
pair of oli^nucleotides. The oligonucleotide pairs are allowed to anneal, and 
q>proximat6ly five to she of these double-stranded fiagments are then allowed 
to aimeal together via the cohesive single stranded ends, and then they ligated 
together and cloned mto a standard bacterial cloning vector, for example, a 
TOPO® vector available 6om Invitrogen Corporation, Carlsbad, CA. The 
construct is then sequenced by standard methods. Several of these constructs 
consisting of 5 to 6 fragments of 80 to 90 base pair fragments ligated together, 
{.a, fi^gments of about 500 base pairs, are prepared, such that the entire 
desired sequence is represented in a series of plasmid constructs. The inserts 
of these plasmids are then cut with appropriate restriction enzymes and ligated 
together to form the final construct The final construct is then cloned into a 
standard bacterial cloning vector, and sequenced. Additional methods would 
be immediately apparent to the skilled artisan, hi addition, gene synthesis is 
readily available commercially. 

In certain embodiments, an entire polypeptide sequence, or fragment; 
variant, or derivative thereof is codon optimized by any of the methods 
described herein. Various desired fragments, variants or derivatives are 
designed, and each is tiidi codon-<>ptiinized individuaDy. In addition, partially 
codon-optimized coding regions of the preseot invention can be designed and 
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constnicted For example tibie mvention mcludes a nucldc add fiagmmt of a 
codoi>-qptuiiized coding region emu^ding a polypeptide in which at least about 
1%, 2%, 3,% 4%, 5%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 
55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 100% of the codon 
positions have been codon-optiinized for a given spedes. That is, they contain 
a codon that is preferentially used in the genes of a desired spedes, eig., a 
vertebrate spedes, eg:, humans, in place of a codon timt is nonnaUy used in 
the native nucldc add sequence. 

In additional embodiments, a fiill-lenglh polypqitide sequence is 
codon-optimized for a given spedes resulting in a codon-optimized coding 
region encoding fhe enliie polypeptide^ and then nuddc add fiagments of llie 
codon-optimized coding region, encode fiagmrats, variants, and 

derivatives of the polypeptide are made from fhe original codon-optimized 
coding region. AswouldbeweUundeistoodbythoseof ordinary skill in the 
art, if codons have been rmdomly assigned to the M-length coding region 
based on thdr frequency of use in a given species, nucldc add fragments 
encodmg fragmmts, variants, and derivatives would not necessarily h^fidly 
codon optimized for the giv^ spedes. However, sudi sequences are still 
mudi closer to the codon usage of the desired species than the native codon 
usage. The advantage of this ^jproach is that synthesizing codon-optimis^ 
nucldc add fiagments encoding each fragment, variant, and derivative of a 
given polyp^tide, although routine, would be time consuming and would 
result in significant expense. 

The codon-optimized coding regions can be versions encoding any 
gene products from any strain of Bacillus anthraciSy or fragments, variants, or 
derivatives of such g^e products. Described herdn are nucldc add 
fragments of codon-optimized coding regions encoding the Badttus anthrads 
protective antigen (PA) gene and ih&Badttus anffiracis lethal &ctor (LF), the 
nucldc add fragments encoding the conqplete polypeptide, as well as various 
fragments^ variants, and derivatives thereof, altibougji other PA or LF - 
encoding nucldc add sources are not excluded. 
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The presmt iBvention is directed to coiXQ>ositioxis and methods of 
enhancing the innnune response of a votebrate in need of protection against 
BadUus anthracis infection by admimstering in vivo, into a tissue of a 
vertebrate, a polynucleotide comprisiiog a codon-optimized coding region 

S encoding a polypeptide of Bacillus anthracis^ or a nucleic acid firagment of 

such a coding region encoding a fragment^ variant or derivative thereoC 
Codon optunization is carded out for a particular vertebrate species by 
naelfaods described herein, for example, in certain embodiments codonr 
optimized coding legions encoding polypeptides of Bacitttis anthracis, or 

10 nucleic add fiagments of such codingregions mcoding fiagments, variants, or 

derivatives thereof are optimized according to human codon usa^ The 
polynucleotides of the invention are incorporated into the cdk of the 
vertebrate in vivo, and an immunologically effective amount of a Bacillus 
anthracis polypqptide is produced in vrvo Jh particular, the present invention 

IS relates to codon-optimized coding regions encoding polypqitides of Bacillus 

anthracis^ at nucleic acid fiagments of such coding regions fiagments, 
variants, or derivatives ti^ereof v/bich have been optunized according to 
mammalian codon usage, for example, human codon usage, cow codon usage, 
domestic cat codon usage, or mouse codon usage. For example, human 

20 codon-optimized coding regions encoding polypeptides of Bacillus anthraciSy 

or nucleic acid fragments of such coding regions encoding fragments, variants, 
or derivatives thereof are prepared by incorporating codons preferred for use 
in human genes into the DNA sequence encoding the B. anthracis 
polypeptide. Also provided are polynucleotides, vectors, and other expression 

25 constmcts conqirising codon-optimized coding regions encoding polypeptides 

of Bacillus anihrads, or nucleic add fragments of such coding regions 
encoding fragments, variants, or derivatives thereof, and various methods of 
using such polynucleotides, vectors and other eaqpression constructs. 

The present invention is further directed towards polynucleotides 

30 conq>rising codon-optimized coding regions encoding polypq>tides of 

BadBus anthracis toxin, for exan^le, Bacillus cmihracis lethal toxin and its 
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component polypeptides, for example, lethal fiEU^tor (LF) and protactive 
antigen (PA). The invention is also directed to polynucleotides comprising 
codon-optimized nucleic acid fiagments encoding firagments, variants and 
derivatives of these polypeptides. 

The preset invention provides isolated polynucleotides comprising 
codonroptiniized coding regions of Bacillus anthracis PA, or fiagments^ 
variants, or derivatives thereo£ In certain embodiments described herein, a 
codon-optunized coding legion encoding SBQ ID N0:4 is optimized 
accoiding to codon usage in humans (Homo stqriens). Alternatively, a codon- 
optimized coding region encoding SBQ ID NO:4 may be optinnzed according 
to codon usage in any plant, animal, or microbial species. 

Codon-optimized coding regions cinccding SEQ ID N0:4, optimized 
according to codon usage in himmnB are designed as follows. Hie amino acid 
con^osition of SEQ ID N0:4 is shown hi Table 6. 



TABLB6 



Amino 


Number in 


Acid 


SBQIDNO:4 


A 


Ala 


41 


R 


Arg 


29 


C 


Cys 


0 


G 


Gly 


36 


H 


His 


10 


I 


He 


57 


L 


Leu 


62 


K 


Lys 


60 


M 


Met 


10 


F 


Phe 


24 


P 


Pro 


29 


S 


Ser 


72 


T 


Thr 


58 


W 


Trp 


7 


Y 


Tyr 


28 


V 


Val 


43 


N 


Asa 


69 
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D 


Asp 


47 


Q 


Gin 


31 


B 


Glu 


51 



Using the amino acid composition shown in Table 6, a human codon- 
optimized coding region vMch encodes SEQ ID NO :4 can be designed by any 
of the mefliods discussed herein. In fhe first sppro^ch^ each amino acid is 

5 assigned the most frequent codon nsed in the human genome for that amino 

add. According to ttiis mediod, codons are assigned to the coding region 
encoding SBQ'ID N0:4 as follows: fhe 24 phenjdalamne codons are TTC, fhe 
62 leucine codons are CTG, the 57 isoleucine codons are ATC, die 10 
methionine codons are ATO, the 43 valine codcnis are QTG» die 72 serine 

10 codons are AGC»11ie 29 proline codons are CCX^,die 58 threonine codons are 

ACC, the 41 alanine codons are GCC, the 28 tyrosme codons are TAG, die 10 
histidine codons are CAC, fhe 31 glutamine codons are CAG, die 69 
asparagtne codons are AAC, fhe 60 lysine codons are AAG» the 47 aspaitic 
acid codons are GAG, the 51 glutamic acid codons are GAG, the 7 tryptophan 

15 codons are TGG, the 29 arginine codons are CGG, AGA, or AGG (the 

j&equencies of usage of these three codons in the human genome are not 
significantly different), and die 36 glycine codons are GGC. The codon- 
optimized PA coding region designed by this mediod is presented herein as 
SEQIDN0:21. 

20 Alternatively, a hnman codon-optimized coding region which encode 

SBQ ID N0:4 can be designed by randomly assigning each of any givesi 
amino acid a codon based on the frequency that codon is used in the human 
gmome. These fiequendes are shown in Table 2 above. Using dns latter 
mediod, codons are assigned to the coding region encoding SEQ ID N0:4 as 

25 follows: about 11 of fhe 24 phenylalanine codons are TTT, and about 13 of 

die phoaylalanine codons are TTC ; about 5 of the 62 leucine codons are TTA^ 
about 8 of the leucine codons are TTG, about 8 of fhe leucine codons aro CTT, 
about 12 of fhe leucme codons are CTC, about 4 of the lendne codons are 
CTA, and about 25 of die leucine codons m CIG; about 20 of the 57 
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isoleudne codons are ATT, about 28 of fhe isoleucme codons are ATC. and 
aboiit 9 of the isoleucme codons are ATA; the 10 methionine codons are ATG; 
about 8 of the 43 valine codons are GTT, about 10 of the valine codons are 
GTG, about 5 of the valine codons are GTA, and about 20 of the valine 

5 codons are GTG; about 13 of the 72 serine codons are TCT, about 16 of the 

serine codons are TCC, about 11 of the serine codons are TCA, about 4 of the 
serine codons are TCG, about 11 of the serine codons are AGT, and about 17 
of die Bmns codons are AGC; about 8 of the 29 proline codons are OCT, 
about 10 of the proline codons are CCC, about 8 of the proline codons are 

10 CX^ and about 3 of the proline codons are about 14 of fhe S8 threonine 

codons arc ACT, about 21 of fhe threonine codons are ACC, about 16 of the 
tbreonme codons are ACA, and about 7 of the Ifareoninet codons are ACG; 
about 11 of tibie 41 alanine codons are GGT, about 17 of fhe alanine codons are 
GCC, about 9 of ike alanine codaiis are GCA, and about 4 of the alanine 

IS codons are GCG; about 12 of fhe 28 tyrosine codons are TAT and about 16 of 

fhe tyrosine codons are TAG; about 4 of fhe 10 histidine codons are CAT and 
about 6 of fhe histidine codons are CAC; about 8 of tiie 31 ghjtamine codons 
are CAA and about 23 of the glutamine codons are CAG; about 32 of the 69 
asparagine codoxts are AAT and about 37 of the asparagine codons are AAC; 

20 about 25 of fhe 60 lysine codons are AAA and about 3S of fhe lysine codons 

are AAG; about 22 of the 47 aspartic acid codons are GAT and about 25 of the 
aspartic acid codons are GAC; about 21 of the 51 glutamic acid codons are 
GAA and about 30 of the glutamic acid codons are GAG; the 7 tryptophan 
codons are TGG; about 2 of the 29 a]^^line codons are CGT, about 6 of the 

25 arginine codons are CGC, about 3 of the arginine codons are CGA» about 6 of 

the argjnine codons ace CGG, about 6 of fhe argmme codons are AGA, and 
about 6 of the azgunne codons are AGG; and about 6 of flie 36 glycine codons 
are GGT, about 12 of die gilydne codons are GGC, about 9 of fhe glycine 
codons are GGA, and about 9 of flie gjlycine codons are GGG. 

30 As desaibed above, the torn '"about" means that the number of amino 

adds oicoded by a certain codon may be one more or one less than tilie 
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number given. It wouldbeimderstoodbythoseofordinaiy'skiUinth^ 
the total number of any amino acid in the polypeptide sequence must remain 
constant, therefore, if there is one "more" of one codon encoding a give amino 
acid, there would have to be one "less" of another codon ^coding that same 
amino acid. 

Representative codon-optimized coding regions m(x>ding SEQ ID 
N0:4, optimized according to codon usage in hxnnans designed hy this method 
are presented herein as SEQ ID NO:23, SEQ ID NO:24, and SEQ ID NO:25. 

jji certain aibodiments» the present invention provides an isolated 
polynncleotide comprising a nucleic add fiagment wluch encodes at least 10, 
at least 20, at least 30, at least 40, at least SO, at least 60, at least 70, at least 
80, at least 90, at least 95, or at least 100 or more contigoous amino adds of 
SEQ ID N0:4, ^ere tibie nucldc add fiagment is a fragment of a codon- 
optimized coding region oicoding SBQ ID N0:4. The codon (optimized 
coding region can be optimized according to codon usage in any spedes, for 
example any votcbrate spedes, for example any mammalian spedes, for 
exanptplehumaiL 

Further provided is an isolated polynucleotide comprising a xmcldc 
add fragment of a codon-optimized coding region encoding SEQ ID N0:4, 
where the nucldc add fragmoit encodes amino acids 199 to 764 of SEQ ID 
N0:4. This polypeptide fragment is the 63-kD fimn cleavage product (PA63) 
of the 82-kD protective antigen precursor polypeptide (PA83). The codon 
optimized coding region can be optimized according to codon usage in any 
spedes, for cKBmplo any vertebrate species, for example any mammalian 
spedes, for exanrple human. Included in llns embodiment are nucldc acid 
fragments of a human codon-optimized coding region encoding SEQ ID 
N0:4, selected from: nucleotides 82 to 1779 of SEQ ID N0:1 (shown in Fig. 
1), nucleotides 595 to 2292 of SEQ ID NO:23, nucleotides 595 to 2292 of 
SEQ ID NO:24, and nucleotides 595 to 2292 of SEQ ID NO:25. 

Further provided is an isolated polynucleotide comprising a nucldc 
add fragment of a codon-optimized coding region encoding SEQ ID NO:4, 
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^ere the nucldc acid fiagmrat encodes amino acids 30 to 764 of SEQ ID 
N0:4. This polypeptide firagment is the mature full-length PA, ie., PA83, The 
codon optimized coding region can be optimized according to codon usage in 
any q>6cies, for example any vertdirate species, for example any mammalian 
q>ecies, for exaxxq>le human. Included in this embodiment are nucleic acid 
fragments of a hnman codon-optimized coding region encoding SEQ ID 
NO:4, selected from: nncleotides 88 to 2292 of SEQ ID NO:23, nucleotides 88 
to 2292 of SEQ ID NO:24, and nucleotides 88 to 2292 of SEQ ID NO:25. 

In certain embodimentB, the present imrmtioii provides an isolated 
polynucleotide comprisiiig a imdeic add fi:agme3:it whidi encodes a 
polypeptide at least 60%, 65%, 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 
90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to PA63, 
i.e., amino acids 199 to 764 of SEQ ID N0:4, and where the nucleic acid 
fragment is a variant of a codon-optmiized coding r^on encoding SEQ ID 
N0:4. The codon opthzuzed coding region can be optinpdzed according to 
codon usage in any species, for example any vertebrate species, for example 
any mammalian species, for example faumaiL 

Further provided is an isolated polynucleotide comprismg a nucleic 
acid fragment which encodes a polypeptide variant of PA63, /.e., amino acids 
199 to 764 of SEQ ID N0:4, in which the amino acids corresponding to amino 
acids 342 and 343 of SEQ ID NO:4 have beai deleted, and where the nucldc 
add fragment is a variant of a codon-optimized coding region encoding SEQ 
ID N0:4. This variation in the amino add sequence of PA63 eliminates two 
phenylalanine residues thought to be important in forming the pore in the B. 
oii^Anzcxy leflial toxin. The codon optimized codixig region can be optimized 
according to codon usage in any spedes, for exarr^le any vertebrate cfpedes, 
frnr exanqple any mammalian Gpedes, for exanq>le human. Included in this 
embodiment is anucldc add fragment x^ch is a variant of ahmnan codon- 
optimized coding region encoding SBQ ID N0:4, iK^ere the micldc add 
fragment encodes amino adds 24 to 564 of SEQ ID N0:6 (shown in Fig. 2). 
Also included in this embodiment is a nucleic add fragment conq>risfai& or 
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alternatively consisting of nucleotides 82 to 1773 of SBQ ID NO:5 (shown in 
Fig. 2). 

Fmliiar provided is an isolated polynucleotide comprising a nucleic 
add fragmrait which encodes a polypeptide variant of PA63, Le., mom acids 
199 to 764 of SEQ ID N0:4, in which the aspara^e residues at positions 
conresponding to amino adds 275, 321, 357, 417, 505, 538, 599, 650, 693, and 
738 of SBQ ID NO:4 have been each replaced with an amino adds other than 
asparagine, and vdtiere the nucleic add fiagment is a variant of a codon- 
optimized coding legion encoding SBQ ID N0:4. In certain embodimentSy the 
asparagine residues at positions conresponding to amino adds 275, 321, 357, 
417, 505, 538, 599, 650, 693, and 738 of SEQ ID N0:4 have been eadi 
iqplaced with ghitumitie residues, where the nucldc add fiagmCTt is a variant 
of a codouKqiitimized coding region encoding SBQ ID N0:4. Either of these 
variations in the amino add sequence of PA63 removes adventitous substrates 
for asparagine-linked glycosylation present in the amino add sequence. Hie 
codon optimized coding region can be optimized according to codon usage in 
any species, for exanq>le any vertebrate spedes, for exanxple any mammalian 
species, for example human. Xnchided in this embodiment is a nucleic add 
fragment whidi is a variant of a human codon-optimized coding region 
encoding SEQ ID N0:4, where the nucldc acid fragment encodes amino adds 
24 to 566 of SEQ ID NO: 18 (shown in Fig. 7). Also included in this 
embodiment is a nucldc add fri^ent conqirising, or alternatively consisting 
of nucleotides 82 to 1779 of SEQ ID N0:17 (diown in Fig. 7). 

Further provided is an isolated polynucleotide comprising a nucldc 
add fragmmt which encodes a polypeptide variant of PA63, i\e.> amino adds 
199 to 764 of SEQ ID N0:4, in which tiie amino adds correspondmg to amino 
adds 342 and 343 of SEQ ID N0:4 have been deleted^ where the asparagine 
residues at podtions coiresponding to amino adds 275, 321, 357, 417, 505, 
538, 599, 650, 693, and 738 of SBQ ID N0:4 have been eadi r^laced witii an 
amino adds otiier than asparagine, finr example, glotamine, and wbsK the 
nucldc add fragment is a variant of a codon-pptimized coding region 
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eaooding SEQ ID NO:4. The codxm optimized coding region can be 
optimized aocozding to codon usage in any species, for example any vertebrate 
species, fiur exanq)le any mammalian species, for example hmnan. 

In certain embodiments, the present invention provides an isolate 
polynucleotide comprising a nucleic add frs^ment which encodes a 
polypeptide at least 60%, 65%, 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 
90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to PA83, 
le., amino adds 30 to 764 of SEQ ID N0:4, and where the nucleic arid 
fragment is a variant of a codon-optimized coding region encoding SEQ ID 
N0:4. The codon optimized coding region can be optimized according to 
codon usage in any ^edes, for exanmle any vertebrate species, for exan^yle 
any manimalian q>edes, for exanQ>le hmnan. 

Further provided is an isolated polytmcleotide comprising a nucleic 
add fragment which encodes a polypqytide variant of PASS^ ie., amino adds 
30 to 764 of SEQ ID N0:4, in ^iiich (he amino adds corresponding to amino 
adds 192 to 197 of SEQ ID N0:4 have bem deleted, and where tiie nucldc 
add fragment is a variant of a codon-optimized coding region encoding SEQ 
ID N0:4^ This ^oiation in tiie amino add sequence of PA83 eliminates tiie 
finin cleava^ site in PASS, and tiius the encoded polypq)tide cannot be 
cleaved as a substrate for furin, and cannot form the pore of the lethal toxin of 
B. anHirads. The codon optimized coding region can be optimized according 
to codon usage m any species, for example any vertebrate spedes, for example 
any mammalian species, for example human. Included in this embodimeait is 
a nucldc add fragment whidi is a variant of a human codon-optimized coding 
region encoding SEQ ID N0:4, where the nucldc acid ftagment encodes 
amino adds 24 to 752 of SEQ ID N0:8 (shown in Fig. 3), Also inchided in 
tins embodiment is a nucleic acid JGragment comprising, or ahemativdy 
consisting of nucleotides 82 to 2268 of SEQ ID N0:7 (shown in Fig. 3). 

Further provided is an isolated polynucleotide comprising a nucldc 
add fiagment i^ch encodes a polypeptide variant of PA83, te.^ amino adds 
30 to 764 of SEQ ID N0:4, in which tiie asparajpne residues at positions 
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conespondiBg to amino adds 39, 153, 275, 321, 357, 417, 505, 538, 599, 650, 
693, and 738 of SBQ ID NO:4 have been eadi replaced with an amino acids 
other than asparagine, and where the nucleic acid fragment is a variant of a 
codon-opttmized coding region encoding SBQ ID N0:4. in certain 
embodiments, the asparagine residues at positions corresponding to amino 
adds 39, 153, 275, 321, 357, 417, 505, 538, 599, 650, 693, and 738 of SEQ ID 
NO:4 have hem eadi replaced widi gtetamine residues, ^ere the nucleic acid 
fragment is a variant of a codon-optiroized codiiig region encoding SBQ ID 
NO:4. Either of these variations in the amino acid sequence of PA83 removes 
advendtous substrates for a^aragine-lixiked gIycos}dation present in the amino 
add sequence. The codon optimized coding region can be optimized 
according to codon usagp in any spedes, frir example any vertebrate ^edes, 
for exanq>le any mammalian spedes, for exaniple human. 

Further provided is an isolated polynudeotide conqnising a nucldc 
add fragmmt which encodes a polypeptide variant of PA83, i.e, amino adds 
30 to 764 of SEQ ID N0:4, in ^ch the amino adds corresponding to amino 
acids 192 to 197 of SEQ ID N0:4 have been deleted, where flie asparagine 
residues at positions corresponding to amino acids 39, 153, 275, 321, 357, 
417, 505, 538, 599, 650, 693, and 738 of SEQ ID N0:4 have been each 
replaced with an amino acids other than asparagine, for example, ghitamine, 
and where the nucldc add fragment is a variant of a codon-optimized coding 
region encoding SBQ ID N0:4. The codon optimized coding region can be 
qptibnized according to codon usage in any spedes, for example any vertebrate 
spedes, for example any mammalian spedes, for example human. 

The present invention provides isolated polynucleotides comprising 
oodon-optimized coding regions of Bacillus arUhrads LF, or fragments^ 
variants, or derivatives thereof in certain embodiments desmbed heidn, a 
codon-qptunized coding region aicoding SEQ ID N0:12 is optimized 
according to codon usage in faimaans(JEromoj^i^ Attemativdy, acodoor 
optimized coding region encoding SEQ ID N0:12 may be optimized 
according to codon usag^ in any plant, animal, or microbial species. 
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Codon-optiimzed coding regions eaocoding SEQ ID N0:12, opthnized 
according to codon iisage in hiT m ^R are designed as follows. The amino add 
conspodtion of SEQ ID NO:12 is shown in Table 7. 



TABLE? 



Anuno 


Number in SEQ 


Add 


IDNO:12 


A 


Ala 


34 


R 


Arg 


27 


C 


Cys 


1 


G 


Gly 


35 


H 


His 


21 


I 


ne 


74 


L 


Leu 


80 


K 


Lys 


86 


M 


Met 


10 


F 


Phe 


29 


P 


Pro 


21 


S 


Ser 


54 


T 


Thr 


28 


W 


Tip 


5 


Y 


Tyr 


35 


V 


Val 


40 


N 


Asn 


54 


D 


Asp 


55 


Q 


Gin 


41 


E 


Ghi 


79 



5 

Using fhe amino acid con]|)Osition shown in Table 7, a human codon- 
(^tunized coding region which encodes SEQ ID N0:12 can be designed by 
any of the methods discussed herein. In the fibrst qyproach, each amino acid is 
assigned the most fiequent codon used in tiie human genome for that amino 
10 acid. According to this method, codons are assigned to the coding region 

encoding SEQ ID N0:4 as follows: the 29 phenylalanine codons are TTC, the 
80 leucine codons are CTG, the 74 isoleudne codons are ATC, the 10 
methionine codons are ATG, the 43 valine codons are GTG, the 54 serine 
codons are AGC, the 21 proline codons are CCC, the 28 threonme codons are 
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ACC, the 34 alanine codons are GCC, the 35 tyrosine codons are TAG, the 21 
histidine codons are CAC, the 41 ghitamine codons are CAG» the 54 
asparag^e codons are AAC, the 86 lysine codons are AAG» the 55 aspartic 
acid codons are GAG, the 79 glutamic acid codons are GAG, the 5 tryptophan 
5 codons are TGG, the 27 arginine codons are CGG, AGA, or AGG (the 

frequencies of usage of these three codons in flie human genome are not 
sigoiiicantiy diffcrautX and the 35 glycine codons are GGC* The codon- 
optimized LF coding region designed by this method is presented herein as 
SEQIDNO:22, 

10 Alternatively, a human codon-qptimized coding region which encodes 

SBQ ID N0:12 can he designed by randonily assigning each of any given 
amino acid a codon based on the fiequmcy that codon is used in the human 
genome. These frequencies axe shown in Table 2 above. Usmg this lattier 
me&ody codons ate assigned to flie coding region encoding SEQ ID N0:12 as 

15 foUows: about 13 ofthe 29 phenylalanme codons are TTT and about 16 of the 

phenylalanine codons are TTC; about 6 of the 80 leucme codons are TTA, 
about 10 ofthe leucine codons are TT6, about 10 ofthe leucine codons are 
Cn, about 16 ofthe leucine codons are CTC, about 6 of the leucine codons 
are CTA, and about 32 of the leucine codons are CTG; about 26 of the 74 

20 isoleudne codons are ATT, about 36 ofthe isoleucine codons arc ATC, and 

about 12 of the isoleucine codons are ATA; the 10 methionine codons are 
ATG^ about 7 of the 40 valine codons are GTT, about 9 of the valine codons 
are GTG, about 5 of the valine codons are GTA, and about 19 of the valine 
codons are GTG; about 10 of tiie 54 serine codons are TCT, about 12 of the 

25 serine codons are TCC, about 8 of the serine codons are TCA, about 3 of the 

sedbae codons are TCO, about 8 ofthe serine codons are AGT, and about 13 of 
the serine codons are AQC; about 6 of the 21 proline codons are CCT, about 7 
of the proline codons are CCC, about 6 ofthe proline codons are CCA^ and 
about 2 ofthe prolme codons are CCG; about 7 of the 28 threonine codons are 

30 ACT, about 10 ofthe threonine codons are ACC, about 8 of tiie threonme 

codons are ACA, and about 3 ofthe threonine codons are ACG; about 9 ofthe 
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34 alanine codons are GGT, about 14 of the alanine codons are GCC, about 8 
of the alanine codons are GCA, and about 3 of the alanine codons bio GCG; 
about 15 of the 35 tyrosine codons are TAT and about 20 of tiie tyrosine 
codons are TAG; about 9 of tiie 21 histidine codons are CAT and about 12 of 
5 the histidine codons are CAC; about 10 of tiie 41 glutamine codons are CAA 

and about 31 of the ^utaxnine codons are GAG; about 25 of the 54 a^^aragine 
codons are AAT and about 29 of the asparag^e codons are AAC; about 36 of 
the 86 lysine codons are AAA and about 50 of the lysine codons ate AAG; 
about 25 of flie 55 aspartic acid codons are GAT and about 30 of the BSpsx&c 

10 acid codons are GAG; about 33 of the 79 glutamic acid codons are GAA and 

about 46 of the glutamic acid codons axe GAG; the single cysteine codon is 
eitfas TGT or TGC; the 5 tryptophan codons are TGG; about 2 of the 27 
argmine codons are CGT, about 5 of the arginine codons are CGQ about 3 of 
the argmme codons are CGA, about 6 of the argmine codons are CGG, about 6 

IS of the arguoine codons are AOA, and about 5 of the aiginine codons axe AGO; 

and about 6 of the 35 glycme codons are GGT, about 12 of the glycine codons 
are GGC, about 8 of the glycine codons are GGA» and about 9 of tihie glycine 
codons are GGG. 

As described above, the term "about" means that the number of amino 
20 acids encoded by a certain codon may be one more or one less than the 

number given. Jt would be und^ood by diose of otxlinary skill in the art that 
the total number of any amino acid in the polyp^tide sequence must r^nain 
constant, therefore, if there is one '^more" of one codon encoding a give amino 
acid» there would have to be one "less" of anodier codon encodmg that same 
25 amino acid. 

Representative codon-optimized coding regions encoding SEQ ID 
N0:12, optimized according to codon usa^ in humans designed by this 
metfiod are presmted herem as SEQ ID NO:26» SEQ ID NO:27, and SEQ ID 
NO:28. 

30 In certain embodiments, the present invention provides an isolated 

polynucleotide comgprismg a nucleic add fragment which encodes at least 10, 
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at least 20, at least 30, at least 40, at least SO, at least 60, at least 70, at least 
80, at least 90, at least 9S, or at least 100 or more contiguous amino acids of 
SEQ ID NO:12, where the nucleic acid fiagmrat is a fiagcnent of a codon- 
optimi2;ed coding region encoding SBQ ID N0:12. The codon optimized 
S coding region can be optimized accoiding to codon usage in any species, for 

example any vertebrate species, for example any mammalian species, for 
exanqile human. 

Further provided is an isolated polynucleotide comprisirig a nucleic 
acid fragment of a codonoptimized coding region encoding SEQ ID N0:12> 

10 where the nucleic add fragment encodes amino acids 34 to 809 of SEQ ID 

N0:12. This polypeptide fragment is the mature form of B. anthracis LF. The 
codon optimized coding re^on can be optimized according to codon usage in 
any species, ibr example any vertebrate species, for exantple any mammalian 
species, for example human. Included in this embodiment are nucleic add 

IS fragments of a human codon-optimized coding region oicdding amino adds 

34 to 809 of SEQ ID N0:12, selected from: nudeotides 100 to 2427 of SEQ 
ID NO:26, nucleotides 100 to 2427 of SEQ ID NO:27, and nucleotides 100 to 
2427ofSEQIDNO:28. 

Further provided is an isolated polynucleotide oonocprising a nucldc 

20 add fragment of a codon-optimized coding region ^coding SEQ ID NO; 12, 

where the nucleic add fragment encodes amino adds 34 to 583 of SEQ ID 
NO: 12. This polypeptide fragment encodes domains I-IH of mature B, 
anthrads LF, but not domain IV, the protease domain. The codon optimized 
codmg region can be optimized according to codon usage in any species, £)r« 

25 example any vibrate ^des, for example any mammalian spedes, for 

exanqple human. Included in this embodiment are nuddc add fragments of a 
human codon-optimized coding r^on encoding SBQ ID N0:12, selected 
from: nucleotides 82 to 1731 of SBQ ID N0:13 (ahown in Fig. 5), nucleotides 
100 to 1752 of SEQ ID NO:26, nucleotides 100 to 1752 of SEQ ID NO:27, 

30 and nudeotides 100 to 1752 of SEQ ID NO:28. 
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Further provided is an isolated polynucleotide coniprising a nucleic 
add fragment of a codon-optimized coding region encoding SEQ ID N0:12, 
where the nucleic acid fiagment encodes amino adds 34 to 254 of SEQ ID 
N0:12. This polypq)tide fragm^t encodes a portion of domain I of mature B. 
S /mdrociL; LF, that directly binds to PA63. The codon optimized coding region 

can be optimized according to codon usage in any species, for example any 
vibrate species, for eican^le any mammalian spedes, for exanq>le human. 
Included in this embodim^ are nucldc add fiagments of a human codon- 
optimized coding region encodnig SEQ ID N0:12, selected firom: nucleotides 

10 82 to 744 of SEQ ID N0:1S (shown in Fig. 6), nucleotides 100 to 762 of SSQ 

ID NO:26, nucleotides 100 to 762 of SEQ ID NO:27, and nucleotides 100 to 
762ofSEQIDNO:28. 

In certain embodiments, the presait invention provides an isolated 
polynudeotide con^pridng a nucldc add fiagmmt which encodes a 

15 polypqptide at least 60%, 65%, 70%, 75%, 80%, 85%, 86%, 87%, 88%, 89%, 

90%, 91%, 92%, 93%. 94%. 95%. 96%, 97%, 98%, or 99% identical to LP, 
amino acids 34 to 809 of SEQ ID N0:12, and wheie the nucldc acid 
fragment is a variant of a codon-optimized coding region encoding SBQ ID 
NO: 12. The codon optimized coding region can be optimized according to 

20 codon usage in any spedea, for example any vertebrate species, for example 

any mammalian spedes, for example human. 

Further provided is an isolated polynucleotide comprising a nucleic 
add fragment whidi wcodes a polypq)tide variant of LF, {.e., amino adds 34 
to 809 of SEQ ID NO: 12, in which the histidine residues corresponding to 

25 amino adds 719 and 723 of SEQ ID N0:12 have been deleted, and replaced 

wilh an amino add oth^ than histidine, and/or the gfaitamic acid residue 
corresponding to amino add 720 of SEQ ID NO:12 has been deleted and 
replaced with an amino add other than glutafiiic add, vAiac^ the nuddc add 
fragment is a variant of a codonroptimized coding region encoding SEQ ID 

30 N0:12. M certain embodimmts^ tile histidine reddues corresponding to amu^ 

adds 719 and 723 of SEQ ID N0:12 have beeu ddeted, and replaced with 
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alanine residues, and/or the ghrtanuc acid residue coiresponding to amino acid 
720 of SEQ ID NO:12 has been deleted and replaced with an aspartic acid 
residue, where the nucleic add fiagment is a variant of a codon-optimized 
coding region eacodiug SEQ ID N0:12. Any of these variations in the amino 
add sequence of LF, dther alone or in combination, eliminate the protease 
activity of LF* The codon optimized coding region can be opthnized 
accordiiig to codon usage in any spedes^ for example any vertebrate sped^ 
fixr exasoEple any mammalian species, for exanq>le humait Included in tibds 
embodiment is a nucldc add fiagment which is a variant of a human oodon- 
<^timized coding region encoding SEQ ID NO:l2, where the nucldc acid 
fiagm^ encodes amino adds 24 to 799 of SEQ ID NO:10 (shown in Fig. 4). 
Also included in fiiis embodiment is a nucldc add fisgment comprising, or 
alternatively consisting of nucleotides 82 to 2409 of SEQ ID K0:9 (shown in 
Fig. 4). 

Further provided is an isolated polynucleotide conq)rising a nucleic 
acid fragment whidi encodes a polypeptide variant of LF, amino adds 34 
to 809 of SEQ ID NO: 12, in which the asparagine residues at positions 
corresponding to amino adds 62, 212, 286, 478, 712, 736, and 757 of SEQ ID 
NO: 12 have been eadi replaced with an amino acids other than asparagjoie, 
and where the nucldc acid fragment is a variant of a codon-optimized coding 
region encoding SEQ ID NO: 12. In certain embodiments, the asparagine 
residues at positions corresponding to amino adds 62, 212, 286, 478, 712, 
736, and 757 of SEQ ID NO: 12 have been each replaced witii ghitamine 
residues, where tiie nucldc acid fiagment is a variant of a codon-optimized 
coding region ^coding SEQ ID N0:12. Ei&er of these variations in the 
ammo add sequence of LF remove adventitous substrates for asparagine- 
linkedglycosylation preset in tiie amino add sequ^^ The codon optimized 
coding region can be optimized according to codon usage in any spedes» for 
exanQ>le any vertebrate spedes, for example any mammalian spedes, for 
exanq)le human. 
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Further im>vided is an isolated polynucleotide conqprisiiig a nucleic 
acid fragment wiricli mcodes a polypeptide variant of LF, ie., amino acids 34 
to 809 of SBQ ID NO:12, in which the histidmc residues corresponding to 
amino adds 719 and 723 of SEQ ID NO: 12 have been deleted, and replaced 

5 wilh an amino acid other than histidine^ and/or the glutamic add residue 

GQiresponding to amino add 720 of SBQ ID N0:12 has been deleted and 
replaced with an amino add other than ghitamic acid, and the asparagine 
residues at positions corr^pondiiig to amino acids 62, 212, 286, 478, 712, 
736, and 757 of SEQ ID N0:12 have been each replaced witix m amino adds 

10 other than asparagine^ and where the nucldc add fragment is a vaciant of a 

codon-optimized coding region mcoding SEQ ID NO: 12. In cotain 
embodiments, the histidine residues conesponding to amino adds 719 and 723 
of SEQ ID N0:12 have been deleted, and replaced with alanine residues, 
and/or the ghitamic add residue corresponding to amino add 720 of SEQ ID 

IS N0:12 has been deleted and replaced with an aspartic add residue, and the 

asparagine residues at positions corresponding to amino adds 62, 212, 286, 
478, 712, 736, and 757 of SEQ ID N0:12 have bem each r^laced wifli 
glutamine residues, where the nucleic acid fragment is a variant of a codon- 
optimized coding region encoding SEQ ID NO: 12, Any of these variations in 

20 the amino acid sequence of LP, either alone or in combination, eliminate the 

protease activity of LF, and also, adventitous substrates for asparagine-tinked 
griycosylation present in the amino add sequence have been removed. The 
codon optimized coding region can be optimized according to codon usage in 
any species, for exanqple any vertebrate spedes, for example any mammalian 

25 spedes, for 6xaiq>le human. Included in this embodiment is a nucleic add 

fragment which is a variant of a human codon-optimized coding region 
encoding SEQ ID NO: 12, vfbme the micldc acid fragment encodes amino 
adds 24 to 799 of SEQ ID NO:20 (shown in Fig. 8). Also inchided in ^s 
embodiment is a nucldc add fragment coiiQ)rishig, or alternatively consisthig 

30 of nucleotides 82 to 2409 of SEQ ID N0:19 (shown m Fig. 8). 
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In tbis manner^ the presmt inveaition provides a method of enh&nciiig 
the level of polypeptide expression fix>m delivered polynucleotides tn vivo 
and/or facilitating tq>take of the polynucleotide by the cells of a desired 
spwaes, &T example a vertebrate 8peci<», for example a mammalian species, 
for example humans. Accordingly, the present invention provides a method of 
treatment and prevention agBinst Bacillus mthracis infection. 

Methods and Administration 

The present inv«ition further provides methods for deUvering a 
polypq)tide into a vertebrate which comprise administering to a vertebrate 
one or more of the compositions described herem; such that \xpon 
administration of compositions such as those described herein, a B, anthracis 
polypeptide is expressed in the vertebrate, in an amount sufficient generate an 
immune response to B, anthracis, 

Ihe tenn "vertebrate" is intended to encompass a singular 'Vertebrate** 
as well as plural 'Vertdsrates," and conq)rises mammals and birds, as well as 
fish^ reptiles, and amphibians. 

The temi "tnatrtrnftl " is intended to encompass a singular "mammal'' 
and plural "manunals," and includes, but is not limited to humans; primates 
such as apeSi monkeys, orangutans, and chinqianzees; caiiids such as dogs and 
wolves; felids such as cats, lions, and tigers; equids such as horses, donkeys, 
and zebras, food animals such as cows, pigs, and sheep; ungalates such as deer 
and ^raffes; and uisids such as bears. In particular, the mammal can be a 
human subject, a food animal or a companion animal. 

The present invention further provides a method for gmerating, 
enhancing or modulating an immune response to B. anthracis conqmsing 
administering to a vertebrate one or more of tiie compositions describe 
herein. In tbis method, the composition includes an isolated polynucleotide 
comprising a human codon-optimized coding region encoding a polypeptide 
of Bacillus anthracis^ or a nucleic acid fragment of such a coding region 
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encoding a fiugment, variant, or derivative thereof. The polynucleotides are 
incorporated into the cells of the vatebrate in vivOy and an antigenic amount of 
ttie Badllus anthracis polypeptide, or fragmenl^ variant, or derivative thereof^ 
is produced in vivo. Upon administration of &6 composition accordiiiig to this 
method, the BaciUus anthracis polypeptide is expressed in the vertebmte in an 
amount sufficient to elicit an immune r^ponse. Such m immune response 
might be used, for example, to ^erate antibodies to B, anthracis for use in 
diagnostic assays or as laboratoiy reagents* 

The present invention further provides a method £ar generating^ 
enhancing, or modnlating a protective and/or therq>eulic immune re^Kmse to 
A anthracis in a vertebrate, comprising administering to a vertebrate in need 
of ihCTpeutic and/or preventative immonity one or more of Ifae con:q>ositions 
described herein. In tins mdfaod, the con^osition includes an isolated 
polynucleotide con^sing a human codon-optimized coding region weeding 
a polypq)tide of Bacillus anthracis^ or a nucleic add fragment of such a 
coding region encoding a fragmmt, variant, or derivative thereof The 
polynucleotides are incoiporated into the cells of the vertebrate in vhv, and an 
immunologically effective amount of the Bacillus anthracis polypeptide, or 
fiagment or variant is produced in vivo. Upon administration of ttie 
composition accoixUng to this method, the Bacillus anthracis polypeptide is 
expressed in the vertebrate in a therapeutically or prophylactically effective 
amount 

As used herein, an "immune response" refers to the ability of a 
vertd}rate to elicit an immune reaction to a composition delivered to that 
vertebrate. Bxamples of imnrane responses include an antibody reqxmse or a 
celhilar, e.g., T-cell, response. One or more coQq>08itions of ^ present 
nxvention may be used to treat a vertebrate prophylacticaUy, e,g., as a 
I^ophylactic i^odne, to establish or enhance immunity to B. anthracis in a 
healthy vertebrate prior to exposure to A anthracis or contraction of antiirax 
disease, thus preventing flie disease or reducing the severity of disease 
symptoms. One or more compositions of the present invention may also be 
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used to treat a vertebrate already exposed to B. anthrads^ or already suffering 
from anthrax disease to finAer stiimilate tbe immune system of the vertebrate, 
dius reducing or elnniriating tiie symptoms associated with that disease or 
disorder. As defined herein, ''treatment of a vertebrate" refers to the use of 
One or more conq>ositions of the present invmtion to prevsit, cure, retard, or 
reduce the severity of anthrax disease syoxptoms in a vertebrate, and/or result 
in no worsening of anthrax disease over a specified period of time. It is not 
required tiiat any conopositiQn of flie present invention provide total immunity 

to B. anthracis or totally cure or eliminate all anthrax disease symptoms. As 

I 

used herein, a "a vertebrate in need of therapeutic and/or preventative 
immunity^ refers to a vertebrate ^ch it is desirable to treaty to prevent, 
cure, retard, or reduce the severity of anthrax disease symptoms, and/or result 
in no worsening of anthrax disease over a specified period of tune. 

In other embodiments, one or more compositioiis of the present 
invention are utilized m a "^rime boost" regimen. In these embodiments, one 
or more polynucleotide vaccine compositions of the present invention are 
delivered to a vertebrate^ thereby priming the immune response of the 
vertebrate to B. anthracis^ and tiien a second irrununogenic composition is 
utilized as a boost vaccination. One or more polynucleotide vaccine 
conq)Ositions of the present invention are used to prune immunity, and then a 
second iimmmogenic composition, eg., a recombinant viral vaccine or 
vaccines, a different polynucleotide vaccine, one or more purified snbunit 
Bacillm anthracis protems, eg,, PA or LF or a variant, fi:agment, or 
derivative thereoi^ or the existing AVA anthrax vaccine, is used to boost the 
eas&'BaciUus anthracis immune response. The polynucleotide vaccine 
cornpositions may coxrqirise one or more vectors for expression of one or more 
Bacillus arUhrads lethal toxin gmes as described herein. In addition, 
polynucleotide prime vaccine and the lat^ boost vacdne elicit an immune 
re^Kmseto the same or sunilar anti^ns» or tiiey maybe to different antigens. 



wo 2004/024067 



PCTAJS2003/028199 



-57- 

In another embodiment^ vectois are prepared for expression in the 
recombinant vxros vaccine and in transfected mamxDafian cells as part of a 
polynucleotide vaccine. 

The tenns "piiming*' or "primary" and "boost" or "boosting" are used 
herein to lefo to the initial and subsequent immunizations, r^ectively, Le., 
in accordance with the definitions these temis normally have in immunology. 

Sterile immunity is defined herein as tiie ability to completely inhibit 
the gemoination of anthrax qxires into bacteria. If germination occurs, the 
bacteria produce Letx and surviving rabbits immnnissed against flie PA antigen 
would be expected to generate a response to LF. Ukewise^iabbits immunized 
witibi LF should have a measurable resfponse to PA. 

Antibodies mduced by recombinant PA or by the commercial anthrax 
vaccine, AVA, have been shown to have potoitial activities ottxer than 
neutralization, that may affect the outcome of an infection by antbrax. Among 
these potential activities is the effect of preventing gennination of bacteria 
fiom the spores. (Welkos, S. et at Microbiology. 147: 1677-85 (2001)). 
DNA vaccination may induce levels of antibody consistent with those that 
prevent germination. The absence of an increase in LF, PA, or neutralization 
titere, following infection, has been observed in animals vaccinated with DNA 
vaccina. This is in contrast to animals vaccinated twice with a commercial 
anthrax vaccine, AVA. While not being bound by tiieory, the DNA vaccine 
may induce antibodies that possess novel protective activities independent of 
lethal toxin neutralization. 

In certain embodiments, one or more compositions of the present 
invention are delivered to a vertebrate by methods described herem, fliereby 
achieving an effective immune response^ and or an effective therapeutic or 
preventative immune response* 

More specifically, the con^ositions of ttie present invention may be 
admimstered to any tissue of a verteixrate, including, but not limited to, 
muscle, skin, brain tissue, hmg tissue, liver tissue, splen tissue, bone marrow 
tissue, thynnis tissue, heart tissue, ^g., myocardium, endocardium, and 
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pericardium, lymph tissue, blood tissue^ bone tissue^ pancreas tissue, Iddney 
tissue, gall bladder tissue, stomach tissue, intestinal tissue, t^cular tissue, 
ovarian tissue, uterine tissue, vagjnal tissue, rectal tissue, nervous system 
tissue, eye tissue, glandular tissue, tongue tissue, and conneotive tissue^ e.g,, 
cartilage. 

Fiirthennore, the compositions of Ihe present iavoition may be 
administered to any internal cavity of a vertebrate, including, but not limited 
to, flxe lungs, the moudi, the nasal cavity, the stomach, the peritoneal cavity, 
the intestine, any heart chamber, vems, arteries, ci^illaries, lymphatic cavities, 
the uterine cavity, the vaginal cavity, the rectal cavity, jomt cavities, ventricles 
in brain, spinal canal in spmal cord, the ocular cavities, the lumen of a duct of 
a salivary gland or a livor. When the conq)ositioiis of the present inventiim 
administered to the lumai of a duct of a salivary gland or a liver, the desired 
polypeptide is CTcodedin each of the salivary gland and the liver such that tiie 
polypeptide is delivered into the blood stream of the vertebrate from each of 
the salivary gland and the liver. Certain modes for administration to secretory 
organs of a gastrointestinal system using the salivary giland, liver and pancreas 
to release a desired polypeptide into the bloodstream is disclosed in U.S. 
Patent Nos. 5,837,693 and 6,004,944, both of which are incorporated herein 
by reference in their entireties. 

In one embodiment, the con^ositions are administered to muscle, 
either skeletal muscle or cardiac muscle, or lung tissue* Specific, but non- 
limiting modes for administration to lung tissue are disclosed in Wheeler, C J., 
et oL, Proc NatL Acad. ScL USA P3:11454-11459 (1996), which is 
incorporated herein by reference in its entirety. 

According to the disclosed methods, compositions of the present 
invention can be adminisbsred by intramuscular (i jxl), subcutaneous (sx.), or 
intnpilmonary routes. Other suitable routes of administration include, but not 
limited to intratracheal, transdermal, hitraocalar, intranasal, inhalation, 
mtracavity, intravenous (i.v.), intraductal (e.g;, into the pancreas) and 
intrqiarmchymal (ie., into any tissue) administtation. Transdermal defiveiy 
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includes, but not limited to intradennal (eg., into the dennis or epidennis), 
transdemial percutaneous) and transmucosal administration (z.e, into or 
tiixough skin or mucosal tissue). Intraca:vity adnsinistration includes, but not 
limited to adminitration into oral, vaginal, rectal, nasal, peritoneal, or intestinal 
5 cavities as well as, intrathecal (Le., into spinjal canal), intraventricular (i.e., 

into the brain ventricles or the heart ventricles), inraatrial (Le, into the heart 
atrium) and sub arachnoid (i.e., into the sub arachnoid spaces of the brain) 
administration* 

Any mode of administration can be used so long as the mode results 'in 

10 the expression of the desired peptide or protein, in the desired tissue, in an 

amount sufSdent to genende an immune resfponse to B. anOirads mAlot to 
generate a piophylactically or therapeutically effective immune response to B, 
anthmcis in a vertebrate in need of such response. Administration means of 
the present inv^ition include needle injection, catheter infusion^ biolistic 

IS injectors, particle accelerators (e.g., "gene guns'" or pneumatic ""needleless" 

injectors) Med-E-Jet (VaWsing, H., et aL, J. IhmnmoL Methods 171,11-22 
(1994)), Pigjet (Schrijver, IL, et al. Vaccine 15, 1908-1916 (1997)), Biojector 
(Davis, H., et oL, Vaccine 12, 1503-1509 (1994); Granizindd, R., et al„ Mol 
Med. 4, 109-118 (1998)), AdvanlaJet (Linmayer, L, et aL, Diabetes Care 

20 P:294-297 (1986)), Medi-jector ^artins, L, and Roedl, E, X Occup. Med 

27:821-824 (1979)), gelfoam sponge dq)ots, other commercially available 
depot materials (ag., hydrogels), osmotic pumps (ag., Alza minipumps), oral 
or suppositorial solid (tablet or pill) phamiaccutical fbnnulations, topical skin 
creams, and decanting, use of polynucleotide coated suture (Qin, Y., et al., 

25 Life Sciences 65, 2193-2203 (1999)) or topical 2?)plications during surgery* 

Certain modes of administration are intramuscular needle-based injection and 
pufanonary q)plication via cattieter infiision. Each of the references cited in 
this paragrqih is incorporated herein by ref^ence in its entirety. 

Detemmnng an effective amount of one or more compositions of the 

30 present invention depends iqx>n a number of &ctors including for exanq>le» 

the antigen being expressed, eg:, PA or LF or fiagments, variants, or 
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derivatives thereof, the age and weigjit of the subject, the precise condition 
requiring treatment and its severity, and the route of administration. Based on 
the above factors, determining the precise amount, number of dos^ and 
tjmtng of doses are withia the ordinary skill in the art and will be readily 
detemained by the attending pbysidan or veterinarian. 

C<»xq>osi1ions of the pr^ent invention may include various salts, 
exGipients* delivery vehicles and/or auxilhary agents as are disclosed, e.g„ in 
U.S. Patent Application Publication 2002/0019358, published February 14, 
2002, which is incorporated herein by reference in its ootirety. 

Furttiennore» compositions of the present invention may include one or 
more transfection ftcifitating compomids that fiidlitate delivery of 
polynucleotides to the intodor of a ceU, and/or to a desired location within a 
celL As used herein, the tmns ^transfection ftciliating compound,*' 
"transfecticm fedlitating agent," and "transfection fiualiating material*' are 
synonymous, and may be used interchangeably. It diould be noted that colain 
transfixtion facilitating conqpounds may also be "acjyuvants" as described 
infra, le.y m addition to fadlitatmg delivery of polynucleotides to the tntmor 
of a cell, the compound acts to alter or increase the immune response to the 
antigen encoded by that polynucleotide. Examples of the transfection 
facilitating compounds include, but are not hmited to inorganic materials such 
as calcium phosphate, alum (aluminum sulfiate), and gold particles (a^., 
"powder" type delivery vehicles); peptides tiiat are, for exan^ile, cationic, 
intercell" targeting (for selective delivery to certain cell types), iatracell 
targeting (for nucleor localization or oidosomal esc^e), and ampipathic (helix 
forming or pore forming); proteins that are, for example, basic (eg., positively 
char^^ such as histones, targeting (e.^., asialoprotein), viral (e.g., Sefidai 
virus coat protem), and por&-formins lipids that ace, for example, cationic 
(eg., DMRIE, DOSPA» DC-Chol), basic {e.g., steryl andne), neutral (eg., 
cholesterol), anionic (eg., phosphatidyl serine), and zwitteriordc (eg., DOPE, 
DOPC); and polymers such as dendrimers, star-polymers, "homogenous" 
poly-anuno acids (eg., poly^lysine, poly-arginine), ''hetecogenous" 
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poly-amino adds (eg., mixtures of lysine & glycine), co-polymeis, 
polyvinylpyrrolidinonfi (PVP). and polyetiijdene glycol (PEG), A transfection 
^ciHtating material can be used alone or in combination with one or more 
other transfection fecilitating materials. Two or more transfection &cilitating 

S matsials can be combined by chemical bonding (e.g,^ covalent and ionic such 

as in Hpidated polylysine^ PEGyiated polylysine) (Toncheva, et aL^ Biochinu 
Biophys. Acta i5«0(3>:354-368 (1988)), mechical mixing (e.g., free moving 
matedals in liqidd or solid phase such as ''polylysine + cationic Iq^ids") (Gao 
and Huang, Biochemistry 55:1027-1036 (1996); Tnibet8la>y, ei aL, Biachem. 

10 Biopfys. Acta ii5i:311-313 (1992)), and aggregation (&g., co^xiecipitation, 
gel fixnning such as in cationic lipids -l-poly-lactide co-galactide, and 
polylysine + gelatin). 

One category of transfection fitdlitating materials is cationic lipids. 
Exanq>les of cationic lipids are S-caiboxyspennylglycine dioctadecylamide 

15 (DOGS) and dq)a]mito34-phqpliatidyleihanola]nine-ScaA 

(DPFBS). Cationic cholestml dmvatives are also useful, including {3P-[N- 
N' ,N'-dimethylamino)ethane]-carbomoyl}-cholesteiol (DC-Chol). 
Dimethyldioctdecjd-ammonium bromide (DDAB), N-(3-aminDprop>d)-N,N- 
(6i?-(2-tetradecyloxyethyl)>N-mefliyl-anmionium bromide (PA-DEMO), N-(3- 

20 annnopn>p)d)-N,N-(6is-(2Kiodecyloxyeihjd))rN-methyl-ammom bromide 

(PA-DELO), N,N,N-/rts-^-dodecyloxy)ethyl-N-(3-airdno)pit>pyl-ani^ 
bnmdde (PA-TELO), and N^'<3-aminopropj4)((2-dodecyloxy)e£hyl)rN^-<2- 
dodecyloxy)ethyl-l-pipera2dnaminiinn biomide (GA-LOB-BP) can also be 
en^loyed in the present invention. 

25 Non-diether cationic lipids, such as DL-l,2Hlioleoyl-3- 

dim6diylainino{nop;^-P-hydn>xye1faylanmionium(DOIUdie l'0<!LeyL'2' 
oleoyl-S^dhnethylaminopn^ytp-hydioxyeihjd^^ (DORI ester/ether), 
and thehr salts promote in vtvo gene delivery. In some embodiments, cationic 
lipids comprise groups attached via a hetetoatom attached to the quatematy 

30 ammonium mdefy in tiie head gcoiq;). A glyajd spacer can connect the linker 

to die hydrox)4 group. 
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Spedfic, hut non-Iimitiiig cadonic lipids for use in certain 
embodinients of the present invootioa include DMRIB ((±)-N- 
(2-hydn>xyethj1)-NJhI-dimethyl-2^-H 

bionride), GAP-DMORE ((±>N<3-aminopiopyr)-N^-^ethyl-2,3-bis(5)7?- 
5 9-tdi:adeceneyloj^>-l-propananiinium fatomideX and GAP-DLRIB ((±)-N-(3- 

aininopropyI)-N^^iiniefliyl-2^<fr2$-^^ 
bromide). 

Olfaer cationic lipids include (:l:>NJ4-dimethyl-N-[2- 
(sperminecaiboxamido)eth:^]-23-bis(diole^oxy)-l-pn)pani 

10 pentahydiodiloride (DOSPA), (±)-N-(2-aminoefliyl)-N^-diniefhyl-23- 

bis(tGtradecyloxy)-l-iiiopaniniinium bromide (p-aminoetbyl-DMRIB or PAE- 
DMKE) (Wheeler, et aL, BiocMnu Biophys. Acta 1280:1-11 (1996)), and (±)- 
N-(3-aminopn3pyl)-N,NHiimethyl-2,3-bis(dodecyloxy)-l-propa 
bromide (GAP-DLRIE) (Wheeler, et cd,, Proc, Nail Acad Sci. USA 

15 93:11454-11459 (1996)), wWdi have been developed fix)mDMRffi. 

Other examples of DMRIE-derived cationic lipids that are useM for 
the present invention are (±)-N-(3-aminopropyl)-N,N-dimethyl-2y3-(fris- 
decyloxy)-l-propanaminium bromide (GAP-DDRIE), (±)-N-(3-aminopropyl)- 
N,N-dimethyi-2,3-(to"tetradecyloxy)"l-piopanam bromide (GAP- 

20 DMRIB), (±)-N-((N"-methyl)-N*-ureyl)piopyl-NJ^-dimethyl-2,3- 

bis(tetradecyloxy)-l-propanaminiiim bromide (GMU-DMRIE), (±)-N-(2- 
hydro2qre1fayl)-NJ^'Kiimefeyi-2,3-bis(dodecyloxy)-l-prqpm bromide 
pLRBE), and (±>N-{2-hydroxycthyI>N,N-dime%l-2,3-to-^^ 
octadecenyioxy)propyM- piopanrminiiTm bromide (HP-IX)RIE). 

25 In the embodiments wh^re the immunogenic composition conQ)ri5es a 

cationic lq)id, file cationic lipid may be mixed wifli one or more co-lipids. For 
pmposes of definition, the tenn "co-lqnd" refers to any bydxophobic matedal 
which may be combined with the cationic l^d component and includes 
ano^patfaic fipids, such as phospholipids, and neutral l^ids, such as 

30 diolestcroL Cationic l^iids and co-lqiids may be mixed or combined in a 

number of ways to prodnce a variety of non-covaloidy bonded macroscopic 
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stmctures, iBctuding, for exaxuple, Kposomes, multilainellar vesicles, 
unilamellar vesicles, micelles^ and siiDple fihns. One non-Imiiting class of co- 
lipids are the zwittedonic phosphol^ids, which include the 
phosphatidydefhanolaxDmes and tiie phosphatidylcholmes. Examples of 

5 phosphfltidylefliaiiDlamines, include DOPE^ DMPB and DPyPB. In certain 

embodiments, the co-l^id is DP>i*E, which conqprises two phytanoyl 
sabstitueots incatpoiated into the diacyi^hosphatidylethanolamine skeleton* 

When a con:q>osition of the present invention con^nses a cationic lipid 
andco-lipid, the cationic l^id:co-lipid molar ndo may be fiom about 9:1 to 

10 about 1:9, fix>m about 4:1 to abotxt 1:4 , fiom about 2:1 to about 1:2, or about 

1:1. 

In Older to maximize homogenity, the cationic fipid and co-lipid 
conqMmeaatB may be dissolved in a solvent such as chloroform, followed by 
evaporation of tihe cationic lipid/co-lq>id solution under vacuum to dryness as 

IS a film on the inner surfiice of a glass vessel (e.g.» a Rotovap round-bottomed 

flask). Upon suspension in an aqueous solvent, the amphipathic lipid 
component molecules setf-assemble kto homogenous lipid vesicles. These 
lipid vesicles may subsequently be processed to have a selected mean diameter 
of unifonn size prior to contplexing with, for example, a codon-optimized 

20 polynucleotide of the present invention, according to methods known to those 

skilled m Ae art For example, the sonication of a lipid solution is described 
in Feigner et al., Proc. Natl. Acad. Sd. USA 84,7413-7417 (1987) and in U.S. 
Pat No. 5,264,618, the disclosures of which are incorporated, herdn by 
reference. 

25 In those embodiments where the conqposition includes a cationic Upid, 

polynucleotides of the present invention are conplexed with Iqnds by mixing, 
for example, a plasmid in aqueous solution and a solution of cationic l^d:co- 
lipid as prepared herem are mixed. The concentration of each of the 
constLtuent sohxtions can be ad^'usted prior to mixing such fliat the desired final 

30 plasmid/cationic lq)id:co-%id ratio and &e dedred plasmid final 

concratration will be obtained upon mixing the two solutions. The cationic 
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lipidico-lqnd mixtures are suitably prq)ared by hydratiiig a thin film of the 
mixed lipid materials in an app ro priate volume of aqueous solvent by vortex 
mixing at ambient tanperatures for about 1 minute. The thin films are 
prq)8red by aHmiviTig chlorofomoi solutions of the individual con^Kments to 
afford a desired molar solute ratio followed by aliquoting the desired volume 
of the solutions into a suitable container. The solvent is removed by 
evaporation, first witti a stream of dry, inert gas (e.g. aigon) followed by fai^ 
vacuum treatment. 

Other hydrophobic and amphiphilic additives, such as, for cxamplo, 
sterols, fatty acids, gangliosides, glycolipids, lipopeptides, %osacchandes, 
neobees, niosomes, prostaglandins and spbingolipids, may also be included in 
conQ>08ition8 of flie present invention. In such compositions, these additives 
may be included in an amount between about 0 . 1 mol % and about 99.9 mol % 
(relative to total lipid), about 1-SO mol %, or about 2-2S mol %. 

Additional embodiments of the present invention are drawn to 
compositions comprising an amdliacy agent. The present invention is fiirther 
drawn to methods to use such compositions, methods to make such 
compositions, and pharmaceutical kits. As used herein, an ^'auxiliary agent" is 
a substance included in a composition for its ability to enhance, relative to a 
composition ^Mch is identical except for the inclusion of the auxUiaiy agent, 
the entry of polynucleotides into vertebrate cells tn Wvo, and/or the in vivo 
explosion of polypeptides encoded by such polynucleotides. Aindliary agents 
of the present invention include nonionic, anionic, cationic, or zwitterionic 
sur&ctants or detergents, in particular, nomonic surfitctants or detergents, 
chelators, DNase inhibitors, agmts that aggregate or condense nucleic acids, 
emulsifying or solubilizing agents, wetting agoits, gel-fi>nmng ageots, and 
bufifers» 

Amdliaxy ag^ts for use in compositions of tiie present invention 
inchide, but are not limited to non-ionic detergents and sur&ctants IQEPAL 
CA 630® CA 630, NONIDBT NP-40, Nonidct ® P40, Twecn-20®, Twem- 
80®, Ptaronic® F68, Phnonic F77®, Plunmic P65®, Triton X-100™*, and 
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Triton X-1 14™; the anionic detergent sodium dodecyl sulfete (SDS); the sugar 
stachyose; the condensing agent DMSO; and the chelatoi/DNAse inhibitor 
EDTA« In catain specific ^bodiments, the auxiliary agent is DMSO, 
Nomd^ P40, Phnonic F68®» Pluronic F77® » Fluronic Phiiomc L64®, 
5 and Plvroxdc F108®. See, e.g., U.S. Patent Application Piiblicaticm 

20Q200193S8, published Febmary 14, 2002, windi is incorporated herom by 
reference in its entirety. 

Compositions of the present invention can be formulated according to 
Tsnown methods. Suitable preparation methods are described, for e&mple, in 

10 Remington's Pharmaceutical Sciences, Edition, A. Osol, ed, Mack 

Publishing Co., Easton, PA (1980), and Remington's Pharmaccaitical Scimces^ 

19* Edition, A.R, Gennaio, ed, Mack Publishing Co., Easton, PA (1995), 
both of which are incorporated herein by reference in then entiieties. 
Although the composition may be administered as an aqueous solution, it can 

IS also be formulated as an emulsion, gel, solution, suspension, lyophilized form, 

or any other form known in the art. In addition, the composition may contain 
pharmaceutically acceptable additives including, for example, diluents, 
binders, stabilizers, and preservatives. 

Certain omq^ositions of the present invration may further include one 

20 or more known adjuvants. The term "adjuvant" refers to any material having 

the ability to (1) alter or increase the immune response to a particular antigen 
or (2) increase or aid an effect of a pharmacological agent It should be noted, 
with respect to polynucleotide vaccines, that an ''adjuvant," may be a 
transfection fodlitating material. Similariy, certain "transfection fiicilitadng 

25 materials" described stq^ra, may also be an "adjuvant" An adjuvant may be 

used with a composition comprising a polynucleotide of the present invention. 
In a prime-boost regiment, as described herein, an adjuvant may be used with 
either the prinnng immunization, the booster immunization, or both. Suitable 
adjuvants include, but are not limited to, cytokines and growth factors; 

30 bactmal components (e.g., endotoxins, in particular superantigens, exotoxins 
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and cell wall coirq>onents); aluminum-based salts; caldum-based salts; silica; 
polynucleotides; toxoids; serum proteins, viruses and viialljMlerived 
materials^ poisons, venoms, and cationic lipids. 

The ability of an adjuvant to increase the immune response to an 
aniigw is typically manifested by a significant increase in immune-mediated 
protection. For example, an increase in humoral immunity is typically 
manifested by a significant increase in the titer of antibodies raised to the 
antig^ and an increase in T-cell activity is typically manifested in increased 
cell proliferation, or cdhilBr cytotoxicity. An ai^uvant may also alter an 
ixmnune response^ £br example, by changing a primarily humoral or 
response into a primarily cellular, or Thi re^KXtm 

In certain adjuvant conqiostions, the adjuvants are c^kines. A 
cQDOtposition of llie present invention can comprise one or more cytokines, 
chemoldnes, or conq>ounds that induce the production of cytokines and 
chemokfaies, or a polynucleotide encoding one or more cytokines, chemcddnes, 
or conqiounds that mdiuce the production of cytokiiies and ch^okines. 
Examples include, but are not limited to granulocyte maciophage colony 
stimulating fector (GM-CSF), granulocyte colony stimulating fector (G-CSF), 
macrophage colony stimulating &ctor (M-CSF), colony stimulating factor 
(CSF), erythropoietin (EPO), interleukin 2 (IL-2), interieukin-3 (IL-3), 
interleukin 4 (IL-4), interleukin 5 (IL-S), interleukin 6 (D>6), interl^ildn 7 
QLrT), interleukin 8 (IL-8), interleukin 10 (EL-IO), mterleukin 12 (11^12), 
int^ukin 15 (IL-IS), interleukin 18 (IL-18), interferon Ophsi (IFNa), 
interferon beta (EFNp), interfoon gamma (IFNy), interferon omega (IFNo), 
interferon tan (IFNt), interfenm gammaindudng &ctdrl (IGIF), transfimning 
growtti fictor beta (TGF-p), RANTES (regulated \xpon activation^ normal T- 
cell expressed and presumably sec^reted), mm^hage inflammatory protdns 
(e.g:, MIP-1 a^ha and MIP-1 beta), LeisJmanIa elongation initiating fhctor 
(LEIF), and Flt-3 ligand 

hi certain compositions of the preseot invention, the polynucleotide 
construct may be complexed witii an adjuvant composition comprising (:i:>-N- 
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(3-ammopropyl)-N^-dimethyl-23^^^ 

biomide (GAP-DMORIE). The composition may also 
comprise one or more co-lipids, e.g., l^-dioleoyl-^-glycero-3- 
phosphoethanolamine (DOPE), l^-diphytanojd-^-glycero-S- 

5 phoqphoethanolamine (DPyPE), and/or l,2-dimyristoyl-glycer-3- 

phosphoetfaanolamind PMPB). An adjuvant composition comprising ;GAP- 
DMOSIB and DFj^B at a 1:1 molar ratio is referred to hodn as Vaxfectin™ 
See, e.g, , PCT Publication No. WO 00/57917, which is incorporated herein by 
reference in its entirety. 

10 Nucleic add molecules and/or polynucleotides of the present 

invention, pDNA, mRNA, linear DNA or oU^ucleotides, may be 
solubilized in any of various bufTers. Suitable bufEisrs include, for amsple^ 
phosphate bufTeied saline (FBS), normal saline, Tris buffer, and sodium 
phosphate (&.g., ISO mM sodium phosphate), insoluble polynucleotides may 

IS be solubilized in a weak add or weak base, and then diluted to tiie desired 

volume with a buffer. The pH of the buffer may be adjusted as s^ropriate. 
In addition, a pharmaceutically acceptable additive can be used to provide an 
appropriate osmolarity. Such additives are witibin the purview of one skilled 
in the art. For aqueous compositians used in vrvo, sterile pyrogen-free water 

20 can be used Such formulations will contain an effective amount of a 

polynucleotide together with a suitable amount of an aqueous solution in ordCT 
to prepare pharmaceutically acc^table compositions suitable for 
administration to a vertebrate. 

EXAMPLES 

25 Materials and Methods 



The following materials and methods apply generally to all die 
examples disclosed herdn* Specific matmala and m^ods are disclosed in 
each exanq)le, as necessary. 
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The practice of Ihe presmt invatiQiL will employ, unless oflierwise 
indicated, conventional techniques of cell biology, cell culture molecular 
biology (including PGR), vaccinology, microbiology, recombinant DNA» and 
izomunology, which are within the skill of the art. Such techniques are 
S explained iuUy in the literature. See^ for e?cample. Molecular Qoning A 

Laboratory Mamial, 2nd Ed., Sambrook et aL, ed.. Cold Spring Harbor 
Laboratory Press: (1989); DNA Cloning, Volume I and n (D. N. Glover ed., 
1985); Oligonucleotide Synthesis (M. 1. Gait ed, 1984); MuUis et al U.S. PaL 
No: 4,683,195; Nucleic Acid Hybridization (B. D. Hames & S. L Higgins eds. 

10 1984); Transcription And Translatian (B. D. Ham.es & S. J. IBggms eds. 

1984); Culture Of Animal Cells (R. L Freshney, Alan IL Liss, lac, 1987); 
bnmohUized Ceils And Enzymes (JKL Press, 1986); B. Pecbal, A Practical 
Guide To Molecular Cloning (1984); the treatise. Methods In Bnzymology 
(Academic Press, Inc., N.Y.); Gene Transfer Vectors For Mammalian Cells (J. 

IS R MiUer and M. P. Calos eds., 1987» Cold Spring Harbor Laboratory); 

Methods Jh Enzymotogy, Vols. 154 and 155 (Wu et al. eds.X Immunochemical 
Methods In Cell And Molecular Biology (Mayer and Walker» eds.. Academic 
Press, London, 1987); and in Ausobel et al. Current Protocols in Molecular 
Biology, John Wiley and Sons, Baltimore, Marjdand (1989). 

20 

Plasmid Vector 

Constructs of the present invention were inserted mto eukaryotic 
ecqrression vector V1012. This vector is built on a modiiBied pUC18 
bacl^mid {see Yanisch-Perron, C, et al. Gene 35:103-119 (1985)), and 

25 contains a kanamydn resistance gene, the human cytomegalovhns immediate 

early 1 promoter/enhaocer and intron A, and Ihe bovine growth hormone 
transcription termination signal, and a polylinker for inserting foreign genes. 
See Hartikka, L, et al.. Hum, Gene Ther. 7:1205-1217 (1996). However, other 
standard commercially available eukaryotic e3q)ression vectors may be used m 

30 the present invention, including, but not lunited to: plasmids pcDNA3, 
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pCMV/Zeo, pOG.l, pEFl/His, pIND/GS, pRc/CMV2, pSV40/Zeo2, 
pTRACER-CMV, pUB6A^5-His, pVAXl, and pZeoSV2 (available finom 
Ihvitrogen, San Diego, CA)» and plasmid pCI (available fiom Fiomega, 
Madison, WQ. 

5 

Plasmid DNA purification 

Plasmid DNA was transfoimed into Escherichia coli DH5a competent 
cells and highly purified coval^tly closed circular plasmid DNA was isolated 
by a modified lysis procedure (Horn, NA., et al. Hum, Gene Ther. d:565-573 

10 (1995)) followed by standard double CsCl-ethidium bromide gradient 

ultracentrifiigation (Sambrook, J., et aL, Molecular Cloning: A Laboratory 
ManmU 2nd Bd, Cold Spring Harbor Laboratory Press, Plainview, New Yoik 
(1989)). Alternatively, plasmid DNAs are purified using Giga columns from 
Qiagen (Valencia, CA) according to the kit instructions. All plasmid 

IS preparations were free of detectable chromosomal DNA, KNA and protein 

inq)urities based on gel analysis and the bicinchomnic protein assay (Pierce 
Chem. Co., Rockford IL). iBiidotoxin levels were measured using Umulus 
Amebocyte Lysate assay (LAL, Associates of C^e Ood, Fafanouth, MA) and 
were less than 0.6 Endotoxin Units/mg of plasmid DNA The 

20 spectrophotometric A260/A2g0 ratios of the DNA solutions wa:e typically 

above 1.8. Plasmids were elhanol precipitated and resuspended in an 
appropriate solution, e.g., ISO mM sodium phosphate (for other qypropriate 
excq}ients and amdliaiy agents, see Patent Application Publication 
20020019358, pubKshed February 14, 2002). DNA was stored at -20^C until 

25 use. DNA was diluted by mixing it witti 300 mM salt solutions and by adding 

^ipropriate amount of USP water to obtain 1 mg/ml plasmid DNA in the 
desired salt at the desired molar concentratioa 
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Ihjections of plasmid DNA 

The quadricqjs muscles of xestrained awake mice (e.g., female 6-12 
week old BALB/c mice fiom Harlan Spiague Dawley, Itidiaiu^lis» IN) aie 
injected bilateiaUy with SO \xg of DNA in SO |il solution (100 pig 19 100 |il 

S total per mouse) using a disposable sterile, plastic insulin syting? and 28G 1/2 

needle (Becton-Dickinson, Franklin Lakes, NT, Cat No. 329430) fitted with a 
plastic collar cut from a micropipette tip, all as previously described (Hartildca, 
J., et d.. Hum. Gene Ther. 7:1205-1217 (1996)). 

Animal care throughout the study was in compliance with the ^Ouide 

10 for the Use and Care of Laboratory Animals", Institute of Laboratory Animal 

ResoiHces, Commission on Life Sciences, National Research Council, 
National Academy Press, Washington, D.C., 1996 as well as with Vical's 
Institational Animal Care and Use Committee, 

IS Immune Correlates 

Since anthrax challenge experiments must be carried under strict 
containment conditions, tibiey can be difficult and expulsive, even in laboratory 
animals. Accordin^y, it has been very important for workers in this area to 
develop in vitro assays to measure levels of immunity and to d^onstrate that 

20 tiiese assays sufficiently correlate to in vivo challenges. A number of in vitro 

assays, which are known to those of ordinary skill in the art to be correlates for 
challenges have been have been developed See, e.g.^ Reuveny, S. et a/. Infect. 
Immun. 5P:2888-2893 (2001); KobUer, D. et al. bifecL Jmmun. 70:S44-S60 
(2002); Pitt, MX. et al Vaccine iP;4768^773 (2001); and Park. S.. and 

25 Lq)pla, SJL Protem Ejqxr. Purif i&293-302 (2000), each of which is 

incorporated herein by re&rence in its entirety. An additional assay is 
described in Bxample 9(b), te^a. 
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EXAMPLEl 

Construction of an Isolated Polynucleotide Conqmsing a Human Codoi^ 
Optimized PA Coding Region, l^icoding the Full Length BadUus Anthrads 

Protective Antigen (PA) 

5 A representative native Bacillus anthrads protective antigen (PA) 

nucleotide sequence consists of nucleotides 1804 to 4098 of GenBank 
accession number M22589 version M22589.1 GI:143280 (SEQ ID N0:3). 
See WeIkos,SX. et al. Gene 5P:287-300 (1988), \^ch is incorporated herein 
by referecLce in its entirety. The PA sequence encode a 764 amino add (aa) 

10 precursor protein (SEQ ID N0:4) that is processed by a signal peptidase upon 
season by the bacteria, and also by host serum iHK)tease8 (reviewed in 
Mesnage S., and Fouet, A. 1 BaderioL 184:331-334 (2002), which is 
incorporated by reference herein in its mtirety). Hie first 29 amino acids of 
PA encodes a bacterial signal sequence that is cleaved during secretion from 

15 llie bacteria. In the host, furin-like serum proteases cleave off the N-temnnal 
258 amino acids to yield PA63, the active form of PA that can bind lethal 
£Etctor (LF) and edema &ctor (EF), thereby causing toxicity. 

A nucleic acid coding region for full-length PA (SEQ ID N0:4), 
optimized for human codon usage was derived by detemining codon 

20 frequencies fiom tiie human codon usage table (Table 2) as described above. 

The codon-optimized nucleic acid sequence was created by using the various 
codons encoding the amino acids of SEQ ID N0:4, each at the firequeaicies 
with vdndi they occur in the codon usage table of Table 2. Although any 
codon-optmuzed coding region which encodes SEQ ID N0:4 may be used, 

25 including, but not limited to SEQ ID Nos 23, 24, or 25, this Bxan^le and other 

Examples below use the human codon-optixnized coding region encoding SEQ 
ID NO:4 represented by SEQ JD NO:23. Alternatively a human codon- 
optimized nucleic acid coding region encoding SEQ ID N0:4 can be prq)ared 
by referring to the codon usage table of Table 2» and using only the most 

30 fiequent codons for each amino acid, as represented by SEQ ID N0:21. 
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The micleic add rq>resented by SEQ ID NO:23 is constructed in the 
folloiraig manner. First, a saies complementary oligonucleotide pairs of 80- 
90 nucleotides each in length and spanning the lengQi of SEQ ID NO:23 are 
synthesized by standard methods. These oligonucleotide pairs are synthesized 

S such that upon annealing, they form double stranded fragments of 80-90 base 

paii8» containing cohesiye oids. The aingl^stranded ends of each pair of 
oligomicleotides are designed to anneal with a single-stranded end of an 
a^acent oligonucleotide diqilex. Several adjacent oligonucleotide pairs 
pr^ared in Hiis naanner are allowed to annea], and appraximately five to six 

10 adjacent oligonucleotide diqilex fiagmenls are then allowed to anneal together 

via the cohesive single stranded ends. This series of annealed oligonucleotide 
diQ>lex fragments is then ligaled together and cloned into the TOPO® vector 
available fiom Inyitiogen Cknporation, Carlsbad^ CA. The construct is then 
sequenced by standard methods. Constructs prepared in this manner, 

15 comprising S to 6 adjacmt 80 to 90 base pair fiagments ligated tog^her, i.e, 

fiagments of about 500 base pairs, dre prepared, such that the entire deshed 
sequence of SEQ ID NO:23 is r^resented in a series of plasxnid constructs. 
The inserts of th^e plasmids are then cut with appropriate restriction enzymes 
and ligated together to form the final construct The final constnict is then 

20 cloned into a standard bacterial cloning vector, and sequmced. 

EXAMPLE2 

Construction of an Isolated Polynucleotide Comprising a Human Codon- 
Optimized LF Coding Region, Encoding the Full Length Jffactf/us Anthrads 

Lethal Factor (LF) 

25 A representative native Bacillus anthrads lethal factor (LF) nucleotide 

sequence consists of nucleotides 685 to 3111 of GenBank accession nlmiber 
M30210 version M30210.1 GL143141 (SEQ ID N0:11). The LF sequence 
encodes a 809 amino acid precursor protein that is processed to a 775 amino 
add secreted protein by cleavage of its signal seqimce. LF is a zinc 
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metalloprotease that cleaves mitoi^-activated pxotem kinase kuiases 
(MAPKKs) contained inside target cells. See Mesnage S., and Fouet, A. J. 
Bacteriol 754:331-334 (2002). Nmnerous mutations in LF have been 
described that eliminate 2dno binding or the catalytic site of LF resulting in the 

5 loss of toxicity. See Hammond, S.E., ami Hanna, P.C. Infect. Lnmun, 

66:237^237% (1998). One form of inactive LF is described in detail herein, 
but all others could also be used with an identical apptoachu 

A nucleic acid coding region for full-length LF (S£Q ID N0:12), 
optimized for human codon usage was derived by detenrnning codon 

10 fiequendes &om the human codon usage table (Table 2) as described above. 

The codon-optimized nucleic acid sequmce was created by using the various 
codons encoding the amino adds of SEQ ID N0:12, each at the fiequendes 
with whidi Ihey occur in the codon usage table of Table 2. Although any 
codon-optimized cqdiog region which mcodes SEQ ID N0:12 may be used, 

IS induding, but not limited to SEQ ID NOs 26, 27, and 28, this Example and 

other Bxaniples below use the human codon-optimized coding regiati 
encoding SEQ ID N0:12 represeaited by SEQ ID NO:26. Altematively a 
human codon-optimized nucleic add coding region encoding SEQ ID NO: 12 
can be prepared by referring to the codon usage table of Table 2, and using 

20 only the most fi:equent codons for each ammo add, as represented by SEQ ID 

NO:22. 

The nucldc acid rq>resented by SEQ ID NO:26 is constracted 
commerdally by R^iogen, San Diego, CA, in the following manner. First, a 
series conxplementary oligonucleotide pairs of 80-90 nucleotides each in 

25 length and spanning the length of SEQ ID NO:26 are synthesized by standard 

methods. These oUgonncleotide pairs axe synthesized sudi that upon 
amiealing, tiiey fbmi double stranded fiagqaents of 80-90 base pairs, 
containing cohesive ends. The single^tranded ends of each pah of 
oligonucleotides are designed to anneal wifli a singlo-stranded end of an 

30 84iacent oli^nuicleotide dvplex. Several adjaceoit oligonucleotide pahs 

prepared in tins manner are allowed to anneal, and approximately five to six 
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adjacent oligonucleotide duplex fi:agtiients are tbtea allowed to anneal toge&er 
via ttie cohesive single stranded ends. This seri^ of annealed oligonucleotide 
duplex fiagments are ibm ligated together and cloned into a the TOPO® 
vector available from InvitrogOT Corporation, Carlsbad, CA. The construct is 
then sequenced by standard methods. Construpts prepared in this maimer, 
cQnq>rising 5 to 6 acgacent 80 to 90 base pair fragments ligated together, 1&» 
fragmoits of about 500 base pairs, are prepared, such that the entire desired 
sequence of $EQ ID NO:26 is represented in a series of plasmid constructs. 
The inserts of these plasmids are titien cut with appropri ate restriction enzymes 
and ligated together to form the final construct. The final oonstnict is dien 
cloned into a standard bacterial cloning vector, and sequenced. 

EXAMPLES 

Constmction of Plasmid Constructs Congnising Fragments, Variants, and 
Derivatives of a Hunoian Codon-Optimized Coding Region Encoding Badllus 

Anthracis PA 

Several fi^;ments, variants^ and derivatives based on SEQ ID NO:23, 
the human codon-optimized coding region encoding Bacillus anthracis PA 
desmbed in Exanqple 1, were constructed in tiie following manner. Codon- 
optimized nucleic acid fiagments encoding three altonate forms of PA were 
constructed, namely, a nucleic add firagment encoding fuU-Ioigth PA minus 
the furin cleavage site (PA83A Furin), a nucleic add fiagment encoding the 
active furin cleavage product of mature PA (PA63), and a nucldc add 
firagment encoding the active furin cleavage product of mature PA in which 
Phe 342 and 343 have been deleted (PA63API0- Each of tiiese nucleic acid 
fragments were fused in-fi:ame to a nucldc acid encoding a human ti^ue 
plasminogeai activator (TPA) signal peptide sequence that directs the 
ejcpressed PA variants and/or fragments to the secretory pathway in 
mammalian cells. Other usefiil PA fragments, variants and/or derivatives will 
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be rcadily parent to those of ordinary skill in the art, and are included in tiie 
present invention. 

a) Construction of TPA-PA63. 

5 PA63, the C-tenninal fiagment of FA conresponding to amino adds 

199-764 of SEQ ID N0:4 corresponds to the mature, extracellularly processed 
protein that is able to bind to LF and edema fector (EF). 

TPA-PA63 (Fig. 1, SEQ ID N0:1) was constructed commCTciaUy by 
Retcogen, San Diego, CA A large number of other companies which provide 

10 similar construction of predeteonined nucleic acid sequences are well known 

to those of ordinary skill in the art The sequmce was constructed in the 
following mann^. First, a series conq>lementary oUgonucleotide pairs of 80- 
90 nucleotides each in length and spanning the length of SEQ ID N0:1 were 
syndiesized by standard m^ods. These oligonucleotide pairs were 

IS synthesized such that upon annealing, they foimed double stianded fiagmetits 

of 80-90 base pairs, containing cohesive ads. The single-stranded ends of 
eadi pair of oligonucleotides woe designed to anneal with a single-stranded 
end of an adjacent oligonucleotide diq;>l6x. Several aciyacent oligonudeotide 
pairs prq>ared in this maamea: were aUowed to anneal, and iqpproximately five 

20 to six a^'acent oligonucleotide duplex fragmeaits w^ fiien allowed to anneal 

togeth^ via the cohesive single stranded ends. This series of annealed 
oligonucleotide duplex fragments were then ligated toother and cloned into a 
the TOPO® vector available fiom invitrogen Corporation, Carlsbad, CA. The 
construct was then sequenced by standard methods. Constructs prepared in 

25 this manner, conqnising 5 to 6 adjacent 80 to 90 base pair fr^ments ligated 

togeflier, fragments of about 500 base pairs, were prepared, such that the 
entire desired sequence of SEQ ID N0:1 was represented in a series of 
plasmid constructs. The xos^ of these plasmids were then ciit with 
^ropriate restriction eaazymes and ligated together in the TOPO® vector. 
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This constiuct was cut with EcoRV + BaniHI and the 1788 bp insert 
firagment (ie., SEQ ID N0:1) was cloned into the same sites of the VR1012 
expression plasmid (see Hartikka et oL, Hum. Gene Therapy 7:1205-1217 
(1996), which is incorporated herein by reference in its entirety). The 
5 resulting plasmid, designated VR6290, was sequenced and es}>re8sed in 

transiently transfected VM-92 cells in culture (see Bxample 6) to confinn the 
esqaession and secretion of the construct 

b) Construction of TPA-PA63AFF. 

10 A different non-toxic form of PA can be generated by deleting the two 

phenylalanine r^du^ at positions 342 and 343 of SEQ ID N0:4 to produce a 
PA protein that cannot hq>tamerize and form a pore to allow LF and EF to 
caotCT the cytoplasm of an infected cell. See^ e.g.^ Singh» Y. et al. J. Biol 
Chem. 2tfP:29039-29046 (1994), which is incorporated herein by reference in 

IS its entirety. 

An expression plasmid comprising IPA-PA63A FF (Fig. 2, SEQ ID 
NO:S) was prepared by the following method Plasmid VR6290, prepared as 
desmbed in section (a), stpra, was used as a template for PGR with the 
following two sets of PGR primets using Turbo Pfu polymerase fium 

20 Stratagene Ihc«, La Jolla, CA 

1. TPAHfor 5*GAGCTTOATATCGCCACCATGOATGC3* (SE^ 
NO:29) and PA del FF-Rev 5' CCACCAATAT CCGATGCATG 
GACTTCCGC 3' (SEQ ID NO:30) produced a 520 bp fragment 



Z HPA-endRev 5'CTTGAAGGATCCTCAACCGATCTCGTAG3' 
(SEQ ID N0:31) and PA del FF-For 5' CCATGCATCG 
QATATTGGTO GCTCCGTOTC 3' (SEQ ID NO:32) produced a 
1280 bp fragment that overlapped fragment 1. 

30 

Fragments 1 and 2 were gel purified using the QIAquick Gel 
Extraction Kit from Qiagen Lie (Valencia, CA) and the fragmaits were 
combined in a subsequmt PGR reaction and amplified with the primer pair 



wo 2004/024067 



PCT/US2003/028199 



-77- 

TPA-for and HPA-endRev to yield the fall laigfli 1782 bp j&agment shown in 
Fig 2, This fragment was digested with the restriction eatizymes EcoR5 + 
BamHI and ligated into the same sites of the VR1012 expression plasmid 
The resulting plasmid, designated VR6291» was sequenced and expressed in 
S transiently transfected VM-92 cells in culture (see Exanq)le 6) to confimi the 

expr^ion and secretion of the construct 

c) Construction of TPA-P-A^A Furin. 

Deleting the furin cleavage site of the mature PA, Le.^ amino acids 

10 192-197 (Ser-'Arg-Lys-Lys-Arg-Ser) of SEQ ID N0:4, yields a protein that is 

secreted from the cell and that can bind the host cell receptor but cannot bind 
LF or EF and thiCTefbre is non-toxic. See, e.g., Singh, Y, et al Infect. ImmutL 
tftf:3447-3448 (1998), and Klimpel KR, oi Proc Natl Acad. ScL USA 
^P:10277-1Q281 (1992), which are incorporated herein by reference in their 

IS entireties. 

An oiixression plasmid comprising TPA-PA83Afurin (Fig. 3, SEQ ID 
N0:7) was cot^tructed in the following manna:. A plasmid conqnising a 
codcn-optimized nucleotide sequCTce (as per the sequeoce of SEQ ID NO:23) 
encoding ttie N-tenmnal 20 kD domain of PA, ie., corresponding to the 

20 portian of PA that is cleaved off by furin, was synthesized by R^gen Inc. 

according to the meihod described in section a), ^ra. This plasmid was cut 
with EcoRV+Afel and tbe 570 bp insert was gel purified as above. The 
plasmid VR6290 described in section a) above was digested wi& 
EcoRV+AM and the 6.6 kb linear vector fiagment was gel purified and 

25 ligated to the 570 bp N-terminal fiagment Transformed colonies were 

screened for recombinants by PGR using the primer pair NtermPA seqF 5' 
GTGGACOACC AGGAAGTGAT C 3' (SEQ ID NO:33) and NtermPA seqR 
5'0GCTATCTGTCCAGTACAGCTTGAA3'(SEQIDN0:34). A selected 
recQmbinant designated VR6292, was sequenced and was expre^ed in 



ViO 2004/024067 



PCTAIS2003/028199 



-78- 

transiently transfected VM-92 cells in culture (see Rsan^le 6) to coufina the 
expression and secretion of the construct 

EXAMPLE 4 

Construction of Plasmid Constructs Comprising Fragmeiits, Variants, and 
5 Derivatives of aRimanCodon-OptimizedCodingRegionBDCodingJ^ac^^ 

AfUhracisLP 

Several fragments, variants, and derivatives of SEQ ID NO:26, the 
hoxnan codon-optimized coding region encoding Bacillus anthracis UB^ as 
prepared in Example 2, were constructed in the following mami^. Codon- 

10 optimized nucleic acid firagments encoding four alteniate forms of LP were 

constructed, namely, a nucleic acid fragment encoding the jfuU-length mature 
LF in which His 686, £5s 690 and Glu 6S7 have been substituted witii Ala, 
Ala, and Asp, respectively (LP HEXXEl), a nucleic add fragment encoding 
amino acids 34 to S83 of fiill-length LF, encoding domains I-m of matoie LF 

IS (LF D(nnain I-IU), a nucleic add fragment ei:KX>ding amino acids 34 to 254 of 

mature LF, corresponding to domain I of mature LF (LP Domain lAX and a 
nucleic add fragment encoding amino adds 34 to 295 of mabue LF, 
cozrespondmg to domain I of matore LF (LF Domain IB). Each of these 
nncldc add fragments were fiised in-fiame to a nuddc add encoding a 

20 human tissue plasminogm activator (TP A) signal pq>tide sequence tibat directs 

the expressed LF variants and/or fragments to the secretory pathway in 
mammalian cells* Furthermore, otiier useful LF fragments, variants and/or 
derivatives would be readily {parent to those of ordinary skill in the art 

25 a) Construction of TPA-LFHEXXH. 

This construct encodes fiiU length LF (minus the bacterial signal 
sequence) with three point mutations that rmder LF non-toxic. Each of these 
mutations, alone or together, are though to eUminate tiie enzymatic activity of 
LF, tiiereby rendering it non-toxic. See, e.g., Hammond, SB and Haona PC, 
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Jn/ect ImmuTL 55:2374-2378 (1998), wbidi is incorporated herein by 
re&ience in its entirety. Other LP mutants contained in this reference, e.g., 

IjjE«g7C^ jjpB687D^ jj^^ lj^H69QA^ ^ jj^68fiA4«fi9aA ^ j^^j^ ^ 

present invention. While not being bound by theory, substitution of the 

5 histidine residues at positions 686 and 690 is tbou^ to decrease zinc binding, 

resulting in decreased or ik> protease activity, and substitution of the glutamic 
add at position 687 is thought to also eliminate prot^e activity, thereby 
resulting in no fnv^firo or mWvoniaciaphagek^ This construct combmes 
all three mutations to afiford a greater perceived level of safety than either 

10 point mutation alone. 

An expression plasmid coniprising LF HEXXH (Fig. 4, SEQ JD N0:9) 
was prepared in the following maimer. The entire 2418 bp> sequence was 
synfliesizedbyRjetrogenlhc. and inserted into the EcoRV andBamHI sites of 
the TOFO vector as described in Example 3(a). The resulting plasmid was 

15 digested with BcoRV and BamHI and the 2418 bp insert was purified by gel 

electrophoresis as described above. The insert was ligated into 
EcoRV+BamHI digged VR1012 and transformed into E. colL Transformed 
colonies were screened for recombinants by PGR using the primer pair seqFl- 
hLF 5' CCGTGCTCGT TATTCAGAOT 3' (SEQ ID NO:35) and se(^l2-hLF 

20 5' CCTTCTCTTC TOTGCTAAGG 3' (SEQ ID NO:36). A selected 

recombinant, designated VR6295, was sequenced and was expressed in 
transiently transfected VM-92 cells in culture (see Example 6) to confirm the 
expression and secretion of the construct 

25 b) Ckmstruction of TPA-LF Domain I-nL 

This construct encodes the N-terminal amino acids 34-583 of mature 
LF, corresponding to domains I-EH. The entire protease domain (domain IV) 
has been deleted and is therefore non-toxic. See, e.g.^ Paimifer AD et al. 
Nature 414^229-233 (2001), which is incorporated herein by reference in its 
30 entirety. 
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Receat data saggast LF is cepable of entering ceUiB indqpend^y of 
PA \ismg a region at the N tenninal domain (Kai&hner, R et a/. Proa NcUL 
Acad Sci. 100: 6652-7 (2003)), La addition, the full length LF is able to cause 
impaimieiit of dendritic cell function via domain IV protease degradation of 

5 MAP activated protein kinase kinase (MAPKK). Agrswal, A. et al Nature 

424: 329-34 (2003)). Therefore LF, independently of its association with PA, 
may have toxic effects vMcli could be blocked through vacdboatiozL Througji 
flie inclusion of an component in the vaccine of ttie present invention, it 
may be possible to neutralize LP at a number of domains and to block 

10 potential toxidties that occur in conjunction ^th, or independent o^ binding 

toPA In may also be possible to block the primary binding ofLPtX) PA 

An expression plasmid conqnimig TPA-LF Domain I-m (Fig. 5^ SEQ 
ID NO: 13) was prepared in the following manner. The plasmid ViE(6295 (as 
produced in section a) above, was PGR amplified witti tte primer pair TP A- 

15 for (SEQ ID NO:29) and LF-DomH-R 5* GAACCTGOAT CCCTACACCA 

CCTTGGCOTC GATG 3' (SEQ ID NO:37) using Pfu polymerase. The 1740 
bp fragment was gel purified, digested with EcoRV + BamHI and cloned into 
VR1012. Transformed colonies were screened by PGR using the same 
amplification primers. A selected recombinant, designated VR629S2, was 

20 sequenced and was expressed in transimtly transfected VM-92 cells in culture 

(see Exanq;)le 6) to confirm the eiqiression and secretion of the construct 

c) C<mstruction of TPA-LF Domain lA 

This constmct encodes the N-teiminal amino adds 34-254 of mature 
25 LP, corresponding gmorally to domain L This is the portion of LF that 

directly binds PA See^ e.g., Pannifer AD et al. Nature 414:229-233 (2001). 

An expression plasmid comprising TPA-LF Domain I (Fig. 6, SEQ ID 
N0:15) was prepared in the following manner. The plasmid VR6295 (as 
produced in section a) above, was PGR amplified with the primer pair TPA- 
30 for (SEQ ID NO:29) and O-LF-R 5'GGTAATGGAT CCTGAAAATG 
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CCTTGGCGAA CACCT 3' (SEQ ID NO:38) using Pfu polymerase. The 753 
bp fiagment was gel pmified, digested with EcoRV + BamHI and cloned into 
VR1012. TYansfiaimed colonic wctc screeaed by PGR using the same 
amplification primers. A selected recombinant, designated VR6295G, was 
S sequenced and was expressed in transiently transfi^ted VM-92 cells in culture 

(see Example 6) to canfirm flie eaqpression and secretion of the construct 

(Q ConstractionofTPA-LFDoniainlB. 

Ibis construct encodes the N-tenninal amino adds 34-295 of mature 

10 LF, also corresponding gen^^y to domain L 

An expression plasmid comprisiag TPA-LF Domain IB (Fig. 14> SEQ 
ID NO:39) was prepared in the following maimer. The plasmid VR6295 (as 
produced in section a) above, was PGR amplified with tiie primer pair TPA- 
for (SEQ ID NO:29) and crystal-LF-R 5' CCATACGGAT OCTCACTGGT 

15 CTTTCAOTTC CTCCA 3' (SEQ ID N0:41) using Pju polymerase. The 876 

bp J&agmeacrt was gel purified, digested with EcoKV + BamHI and cloned into 
VR1012. Transfonned colonies wore screened by PGR using the same 
anq>]ification primers. A selected recombinant, designated VR62951, was 
sequraced and was expressed in transieDtly transfected VM-92 cells in culture 

20 (see Example 6) to confirm the expression and secretion of the construct 

EXAMPLES 

N-Linked Glycosylation Mutants 

Most mammalian transmembrane and secreted proteins are 
glycosylated post-translationally in the endoplasmic reticulum* See^ e.g., 
25 Lodish H et al Molecular Cell Biology 4* edition, W. H. Freeman and 

Conqmny, New Yodc TbxxQ ato two main types of protein glycosylation in 
mammalian cells, N-linked and 0-linked. N-linked glycos^ation occors on 
asparigine residues at the amino acid motif N-X-(S/T) where X refers to 
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any frmmo 

add i^due and S/T denotes serine or flnreonme. Hiere are seven 
N-linked glycosylstion motife in mature LF, twelve N-linked glycos)dation 
motife in full-lengfh mature PA (PA83) and ten N-liDked glycosjdation motifi; 
in PA63. Since tins glycosylation does not occur in bacteria, the anthrax 

S antig^ synth^ized in mammalian cells after DNA immumzadon xxmy di£Eer 

from the PA and LP in anthrax toxin seoeted by B. onAracis, See^ e.g.^ 
SchafiBa^ CetaL Proieomics 1:248-246 (2001), which is incozporated herein 
by reference in its entiiety. This jnammalion N-linked glyoos^Uction could 
obscore or alter B-cell antibody ^topes tiiat are normally exposed in 

10 conventional anthrax protein vaccines. Therefiire codon-oplimized coding 

regions encoding PA63 acnd LF were made in which fhe asparagines in the 
linked glycosylation motifs (N-X-S/T) moti& weste mutated to glutamines (Q- 
X-S/T). The motif Q-X-S/T is not subject to glycosylation m mammals. Such 
''sugar minns" variants of any of fhe variants, fiagmeaxts, derivatives, or full 

IS length coding regions disclosed herein, as well as addition variants, fiagmmls 

and derivatives known to those of skill in the art are encompassed by the 
present invention. 

All asparagine (N) residues in the N-X-S/T motife contained in TPA- 
PA63 (SEQ ID N0:1, produced as described in Example 3(a)) and TPA- 

20 LFAHEXXH (SEQ ID N0:9, produced as described m Exanq)le 4(a)) were 

mutated to Ghitamine (Q) by gen^ating a s^es of overlapping PGR 
fiagmeaots. As described in more detail below, these fragments woe added 
together two at a tune and amplified with primers at the extreme end of the 
two fiagments to btdld larger and larger PGR fiagments until a full length 

25 mutant was obtained. M each case, the fiill-lengdi fiagment was gel purified, 

digested with EcoRV + BamHI and cloned into VR1012. All PGR reactions 
were performed with Pfii polymerase from Stratagene Inc using standard 
conditions. 
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a) ConstnictionofTPA-sugarmfflu^ 

This cx)cstmct is the same as SEQ ID N0:1, except fliat all teaN-X- 
S/r motifi in fhe encoded polypeptide have beea changed to Q-X-S/r» via 
point mutations. The xnntated construct was assooobledfomo 
fiagments using SEQ ID NO : 1 as Ihe tefnplate. 

Ten nanograms quantities of plasmidVR6290 DNA was anoplified with 
each of the 10 primer pairs listed in Table 8. The fourth column lists the size 
of the resulting PGR products with the various primer pairs. Each of these 
resulting PGR fragments has a single stranded region at each end, which can 
anneal with a single stranded region on another of the fragments. 



TABLES 



PGR 
Bcagment 


Forward Primer 


Bovene Primor 


Size 


1 


TPA-For (SEOIDNO:29) 


PA-Rl (SE0IDNO:42) 


310 bp 


2 


PA-F2 (SE0IDNO:43) 


PA-R2 (SEQIDNO:44) 


140 bp 


3 


PA-F3 (SEOIDNO:45) 


PA-R3 (SE0IDNO:46) 


90 bp 


4 


PA-F4 (SEOIDNO:47) 


PA-R4 (SEQIDNO:48) 


180 bp 


S 


PA-F5 (SE0E>NO:49) 


PA-R5 (SEOIDNO:50) 


260 bp 


6 


PA-F6 (SEQIDNO:51) 


PA-R6 rSEOIDNO:52) 


100 bp 


7 


PA-F7 (SEQIDNO:53) 


PA-R7 (SEOIDNO:54) 


185 bp 


8 


PA-F8 (SEOIDNO:55) 


PA-R8 {SEQIDNO:56) 


150 bp 


9 


PA-F9 (SEOIDNO:57) 


PA-R9 (SEOIDNO:58) 


130 bp 


10 


PA-FIO (SBOIDNO:59) 


PA-RIO (SEQIDNO:60) 


135 bp 


11 


FA-Fll (SEQIDN0:61) 


HPA-endRev (SEQIDNO:31) 


80 bp 



2,5 microliters of each PGR fragment m Table 8 was combined pair 
wise with a second PGR fragment in Table 8. The two fragments were 
allowed to anneal and were used as tenq>lates in a second series of PGR 
reactions, with resulting PGR fragments as shown in Table 9. 



wo 2004/D24O67 



PCT/DS2003/028199 



-84- 



TABLE9 



PGR 
Fzagmfi&t 


Teocplate 
FtEgznents 


Fax¥renl Pximer 


RBvetsePmoBr 


Sfi» 


12 


2 + 3 


PA.F2 fSBOIDNO:43) 


PA-R3 (SBOE>NO:46> 


230 bp 


13 


4 + 5 


PA-F4 (SB01DNO:47) 


PA.R5 (SBOIDNO:50) 


440 bp 


14 


6 + 7 


PA-F6 (SE0IDN0:51) 


PA-R7 (SEOIDNO:54) 


285 bp 


15 


8 + 9 


PA-F8 (SEOIDNO:55) 


PA-R9 (SE01DNO:58) 


280 bp 


16 


10+11 


PA-FIO (SEOIDNasW 


HPA-endRev (S£0 ID N031) 


215 bp 



2.5 nncroliters of each PGR firagmeot in Table 9 was combined pair 
wise with a second KSEl ftagment in Table 9, The two fiagments were 
allowed to anneal and were used as tenq>lates in a fhiid series of PGR 
reactioiis, with tesutdng PGR fiagments as shown in Table 1 0. 



TABLE 10 



PCR 
Fragment 


Tenqilate 
Fiagnierils 


Fotwaxd Pnmer 


Rjovccbo FdmBr 


Size 


17 


1 + 12 


TPA-For (SEQ ID NO:29) 


PA-R3 (SBOroNO:46) 


540 bp 


18 


13+14 


PA-F4 (SEQIDNO:47) 


PA-R7 (SBOIDNO:54) 


725 bp 


19 


15+16 


PA-F8 (SBQIDNO:SS) 


HPA-cmdRev (SEO ID NOiSl) 


495 bp 



Fragments 17» 18, and 19 were ^1 purified before proceeding to the 
next series of PGR reactions. The last two sets of PGR reactions were earned 
out as listed in Table 11^ using 2.5 microliters of the annealed PGR fragment 
pairs listed in the second column, which had b een gel purified. 



TABLE 11 



PGR 
Fragiceat 


FragniBnts 


Forward Primer 


Reverse Primer 


Size 


20 


17+18 


TPA-For (SEO ID NO-^) 


PA-R7 (SEOIDNO:54) 


1265 bp 


21 


19 + 20 


TPA-For (SEQ ID NO:29) 


HPA-endRev (SEQ ID N0:31) 


178Sbp 



Resulting PGR fiagment 21 repieseuts the fuI14engtfa TPA-Sugar 
minus PA63 jBragment (Figure 7, SEQ ID N0:17). The TPA-sugar minus 
PA63 fragment was cloned into the VR1012 expression plasmid. A selected 
recombinant, designated VR6299, was sequmced, and was expressed in 
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transiently transfected VM-92 cells in cultuiie (see Bxample 6) to confixm flie 
expression and secretion of the constmct. 

The sequences of the primers used in the PGR reactions in this 
Example are listed in Table 12. 



TABLE 12 











TPA — fnr 


(rAnrTTGATATCGCCACCATGGATGC 






CTGGAGACACCTGTTTATCGATCC 


43 


PA-F2 


GQATCGATAAACAGGTGTCTCCAG 


44 


PA-R2 


GAAGTACTGGTCTGTTTAQATATGGT 


45 


PA-F3 


ACCATATCTAAACAGACCAGTACTTC 


46 


PA-R3 


CGTCGAGGACTGGCTATTGCTAA 


47 


PA-F4 


TTAGCAATAGCCAGTCCTCGACG 


48 


PA-R4 ' 


GAGGGTCTGCTGnTGCCXLAGG 


49 


PA-F5 


CCTGGGCAAACAGCAGACCCTC 


50 


PA-R5 


CTTCAGACCACTGTGACCCAGTG 


51 


PA-F6 


CACTGGGTCACAGTGGTCTGAAG 


52 


PA-Ri6 


GATCACTGGGCTGCACGGCGG 


53 


PA-F7 


CCGCCGTGCAGCCCAGTGATC 


54 


PA-R7 • 


TATTGGTGGCCTGCAGCTCTGC 


55 


PA-F8 


GCAGAGCTGCAGGCCACCAATA 


56 


PA-R8 


CAGTACTGCTCTGGATAACTTCCC 


57 


PA-F9 


GGGAAGTTATCCAGAGCAGTACTG 


58 


PA-R9 


AAGCTGGAAATCTGCAGCATATCAT 


59 


PA-FIO 


ATGATATGCTGCAGATTTCCAGCTT 


60 


PA-RIO 


CTCGCTTGGCTGGATGATTGTGT 


61 


PA-Fll 


ACACAATCATCCAGCCAAGCGAG 


31 


HPA-eadRev 


CTTGAAGGATCCTCAACCGATCTCGTAG 



b) Onistraction of TPA-sugar minus IJFHEX^ 

This construct is the same as SEQ ID N0:9» except that all seven N-X- 
SfT motifs in the encoded polypeptide have been changed to QrXSfT^ via 
point mutations. The mutated constmct was assembled fix)moverlqq[iii^PC3l 
fiagments using standard methods, using primers which code for Q residues 
instead of N residues in the seven glycoff^lation moti&^ and using SEQ ID 
N0:9astibiets(Z9late. 
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Tea nanograms quantities of plasimdVR6295 DNA was anq>lified witti 
each of the 8 primer pairs Usted in Table 13. The fourth cohmm lists the size 
of the resulting PGR products with the various primer pairs. Bach of th^ 
resulting PGR fir^meuts has a single stranded region at ^h ad, which can 
azmea] with a single stranded region on another of the fiagments. 



TABLE 13 



PGR 


ForwBid PiiiML 


Reverse Pttxncr 


Size 


1 


TPA-For (SEQK>NO:29) 


LP.Rl (SEOIDNO:62) 


170 bp 


2 


IJ?-F2 (SEQroNO:63) 


LF-R2 (SEQIDNOtW 


4S0bp 


3 


LF-F3 (SEQroNO:65) 


LF-R3 (SEOIDNO:66) 


220 bp 


4 


LF.F4 (SEQIDNO:67) 


LF-R.4 (SEOIDNO:68) 


580 bp 


5 


LF-F5 {SBOIDNO:69) 


Ii?-R5 fSEOIDNO:70) 


700 bp 


6 


LF-F6 (SE0mN0:71) 


LF'-R6 fSBOIDNO:72) 


70 bp 


7 


LF-F7 fSEOIDNO:73) 


Ii?-R7 (SEOIDNO:74) 


65 bp 


8 


LF-F8 (S£0IDNO:7S) 


HLPend-R (SEOIDNO:76) 


165 bp 



2,5 microliters of each PGR fragment in Table 13 was combined pair 
wise with a second PGR fragment in Table 13. The two fiugmeats were 
allowed to anneal and were used as templates in a second series of PGR 
reactions, with resulting PGR fragments as shown in Table 14. 



TABLE 14 



PGR 
FcagiDeiit 


Ten^iatB 
FrBgmdits 


For wtud Piimcx 


BfivenePiima 


Si2» 


9 


1+2 


TPA-Fw(SEQIDNO:29) 


IF-R2 (SEOIDNO:64) 


620 bp 


10 


3 + 4 


LF-F3 (SEQIDNO:65) 


LF-R4 (SEOIDNO:68) 


800 bp 


11 


5+6 


LF-FS (SBQIDNO:69) 


LF-R6 (SEOn>NO:72) 


770 bp 


12 


7 + 8 


LF-F7 (SEQIDNO:73) 


HLFend-R (SEOIDNO:76) 


230 Iv 



2.5 microliters of each PGR fragment in Table 14 was combined pair 
wise with a second PGR fragment in Table 14. The two fragments were 
allowed to anneal and were used as tenoiplates in a third series of PGR 
leactions, with resulting PGR fragments as shown in Table 15. 
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TABLBIS 





Ternplate 
FragmcntB 


PocwBid Primer 


R0VKB6 Pximflr 


Size 


13 


9+10 


1?A-For(SEQIDNO:29) 


LP-R4 (SEQIDNO:68) 


1420 bp 


14 


11 + 12 


U-F5 (SBQIDNO:69) 


HUendpR (SBQn>KO:76) 


1000 bp 



Fragments 13 and 14 were gel purified before prooeeding to the final 
PGR reaction. This PGR reaction was carried out as listed in Table 16, using 
2.5 microliters of the annealed PGR fiagment pairs listed in the second 
oohimn» ^ch had been gel purified. 



TABLE 16 



PGR 

Fratrmffnt 


Template 


Fonraid Primer 


Bsvene PmuBT 


Size 


15 


13 + 14 


TPA-Fot(SE0IDNOJ9) 


HUPend-R CSBOn}NO:76) 


2418 bp 



Resulting PGR fiagmmt IS lepresente Ihe fidl-length TPA-Sugpr 
minus LF HEXXH ficagment (Figure 8, SEQ ID N0:19). The TPA-sugar 
minus LF HEXXH fragment was cloned into the VRl 012 expression plasmid. 
A selected recombinant, d^ignated VR6300, was sequenced and was 
expressed in transiently transfected VM-92 cells in culture (see Example 6) to 
coxxEinn the expression and secretion of the construct. 

The sequences of the primers used in the PGR reactions in this 
Example axe listed in Table 17. 



TABLE 17 



SEOroNO: 


Primer 


Sequence 


29 


TP A -for 


GAGCTTGATATCGCCACCATGGATGC 


62 


DF-Rl 


TCCTGTGTTITCTGACXnTCTTCG 


63 


LF-F2 


CGAAGAACGTCAGAAAACACAGGA 


64 


LP-R2 


TATCroACGCCTGTrTGATTGTGTT 


65 


LF-F3 


AACACAATCAAACAGGCGTCAGATA 


66 


LF-R3 


CCAQAGACAGCTGAATCTCCTGTT 


67 


LF-F4 


AACAGGAGATTCAGCTGTCTCTGG 


68 


LF-R.4 


AGCGGTGAGCTGGITAATATTCATG 
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SEOroNO: 


Primer 


Sequence 


69 


LF-F5 


CATGAATATTAACCAGCTCACCXSCT 


70 


LP-R5 


CCTCAGAATCCTGTCGAAGCTCA 


71 


LF-F6 


TGAGCTTCGACAGGATTCTGAGG 


72 


LF-R6 


GATCAGACTGCTGCTTATCCAACA 


73 


LF-F7 


TGTTGGATAAGCAGCAGTCTGATC 


74 


LF-R7 


AGGAAGTCAGCTGACTCCCTTCC 


75 


LF-F8 


GGAAGGGAGTCAGCTGACTTCCT 


76 


HUFend-R 


GCAGATCTGGATCCTCAAGAG 



EXAMPLE 6 

In vitro Expression of Human Codon-Optimized Coding Regions Encoding B, 
Amhracis PA and LF» and Fragments, Variants and Derivatives thereoJ^ in a 
5 Murine Cell line 

The expression plasmids described Bxanqples 3-5 above and the 
coneqKmding wild type BaciBus anihracis genes wece initially analyzed in in 
vitro transiently transfected cells in cuftme. Initial studies were carried out in a 
wdi diaracterized mouse melanoma cell Hne (VM-92, also known as UM- 

10 449)» using cationic Iqpid-based ttansfection procedm:e5 well known to those 

of skill in the art Other standard cell lines, for example, COS-1 cells, COS -7 
cells, CHO cells, HEK-293 cells, and HeLa cells, may be used for transient 
ttansfections as weU. Following transfection, cell lysates and culture 
siq>ematants of transfected cells were evaluate to compare relative levels of 

IS ^pre^on of B. anthracis antigen proteins. The samples were assayed by 

western Wots and ELISAs, using commercially available anti-PA and Anti-LF 
monoclonal antibodies (available from Research Diagnostics Inc., Flanders 
NJ)^ so as to compare both the quality and the quantity of expressed antigen. 
Additionally, in vitro transfection assays were used to deterarine the effect of 

20 mixing the various plasmids con^msing codon-optimized coding regions 

encoding non-toxic PA and LF on levels of expression in mammalian cells. 

Expr^sion products derived fix)m cdls transfected with tibie various 
polynucleotide constructs are examined to ensure the coxrect or predicted 
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molecular wdght of the recomhmant antigens, and imminioreactivity of the 
recombinant antigens (le,, to react wifh B. anthracis antism). In addition, a 
conqiarison of expression levels (both intra- and extra-cellular) of each class 
of expression plaamid (e.g., wild ^e vs. human codon-optimized; truncated 
vs. foil-length) is made. 

EXAMPLE? 

In vitro Expr^on of Human Codon-Optimized Coding Regions aicoding B. 
AnthmdsfAdMlFtaMFxHSiDaG^ Variants andDerivatives Thcareo^in a 

Human Cell line 

The expression plasmids described Exan^les 3-5 above and the 
corresponding wild type Bacillus anthracis genes are also analyzed in in vitro 
transfected human cells in culture. These studies are carried out in a well 
characterized human cell line, e.g., HeLa cells, ATCC Accession No. CCL-2, 
available fiom the American Type Culture Collection, Manassas, VA, using 
cationic lipid-based transfection procedures well known to those of skill in the 
art Following transfection^ cell lysatBs and cnltune ggpmiatants of transfected 
cells are evaluated to cooQiare relative levels of expr^sion of j5, anthracis 
antigen proteins. The san[q)les are assayed by western blots and EUSAs, 
using commercially available antiPA and Anti-LF monoclonal antibodies 
(available fiom Research Diagnostics Mc, Flanders NJ), so as to con^are 
both the quality and the quantity of expressed antigoi. Additionally, In vitro 
transfection assays are used to determine the effect of mixing the various 
plasmids conqmsing codon-optimized coding regions encoding non-to7dc PA 
and LF on levels of expression inhuman cdls. 

Expression products fiom flie derived fiom human cells transfected 
with Ihe various polynucleotide constructs are examined for molecular weight, 
and expression immunorcactive antigens (i.e., to react with A anthracis 
antisei:a). In addition, a comparison of e]q>ression levels (both intra- and 
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extra-cellular) of each class of expression plasmid {cg.t wild type vs. human 
codon-optunized; tnincated vs. fall-length) is made. 

EXAMPLES 

Animal ImmuiiizatioiL and Challenge 

5 The immunogenidty of esqiression products encoded by the codon- 

optmiized polynucleotides described in Exanq>les 1-5 are evaluated based on 
eai^plasmid's ability to nK>unt a humoidunmi^ Flasmids 
are tested individually and in combinationB by injecting smgle constmcts as 
well as multiple constmcts in various animals as described below. 

10 Immunizadons are initially carried out in mice by intramuscular (M) 

injections. Serum is collected from immunized animals, and the immune 
response is quantitated by EUSA assay using commercially available antiP A 
and And-LF monoclonal antibodies (available from Research Diagnostics Inc., 
Flanders NJ) according to standard protocols. The tests of immunogenicity 

IS iurfh^ include measuring antibody titer, neutralizing antibody titer^ and 

challenging immunized ^m'tnalB with toxin protein. 

Testing in rabbits are then used to confirm the results in mice and 
thereby provide efficacy data for the best plasmids in more than one 
mammaltm immunogenidty model systeoL Serum is collected fixmi 

20 inununized rabbits^ and antibody titers and neiitralizmg antibody titers are 

determined, hi addition, inununized rabUts are tested widi a spore inhalation 
challenge. The combined results detemune the plasmids to be subsequently 
tested in non-human primates. 

25 a) Mouse immunizations. 

The plasmid constructs described in Bxanq)les 3-S, as well as similar 
plasmid constructs conqmsing native coding regions encoding native PA and 
LF, as well as empty control plasmids, are tested in vivo in mice by 
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mtramuscular injection of the rectus femoiis muscle within the quadricq>s, 
using methods described above. There are 5-10 animals per groiq). A 
standard DNA vaccination protocol is used (50 DNA in 150 mM sodium 
phosphate (1 mgM)/leg at 0, 14 and 28 days). Alternative DNA formulations 

5 include PBS instead of sodium phosphate, adjuvants, e.g., Vaxfectm™ at a 4:1 

DNA: Vaxfectin™ miass ratio, mono-phosphor^ lipid A (detoxified 
endoto3dn) from jSI mrmesota (MPL) and trefaalosedicorynQmycolateAF 
(TDM), in 2% oil (8qualene)-Tween 80-water (MPL + TDM, available fiom 
Sigma/Aldrich, St Louis, MO, (catalog # M6536)), a solubilized mono- 

10 phosphoryl Kpid A fbnmilation (AF, available fiom Corixa), (±)-N-p- 

Ac€toxypixy}4>NJ^-dimethyl-2,3-bis(octyloxy>l-propanamm^ chloiide 
(compound # VC1240), or poloxamers, CRL1005 (fiom QigamchCTi) 
and a solution of benzyl-alkomum diloride "BAK" (from Ruger 
Chemicals)("CRL 1005/BAK") {see Shriver, J.W. et al.. Nature ¥i J:331-335 

15 (2002), and P.C.T, Publication No. WO 02/00844 A2, each of which is 

incorporated herein by reference in its entirety); or transfection-fiunlitating 
cationic lipids, e.g;, DMKIE/DOPE at a 4:1 DNA:lipid mass ratio. 

Serum samples for antibody assays are taken at 0, 21, and 41 days. On 
OF about day 42, the vacdnated animals are challenged using either a tail vein 

20 injection of purified lethal &ctor to3dn (Letx) or pijhnonary delivery of 

aerosolized B, anthracls. Mice are challenged using the purified B. anthrads 
lethal toxin (Lebc), the combined mature PA6S and LP piotenis. These 
protems are provided thiou|^ a collaborative agreement wifli Dr. Stq^hen 
Leppla, National Institutes of Dental Research, at the MH. The proteins axe 

25 Gsqpressed in E,coli as recombinant proteins and purified according to 

pribliflhed protocols (jee, e.g., Leppla, SH Methods EnzymoL 755:103-116 
(1988) and Park, S and Leppla, SH Protein &pr. Purif. ift293-3Q2 (2000), 
each of iTvbich are incorporBled hoein by reference in fheir entireties). The 
challenge is conducted by iitjectmg the mouse tail vein with a protein cocktail 

30 containing 60 [x% of purified PA and 25-30 ^g of purified LF. This 
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^>pxoxiniat^ the equivaleat of five 50% letbal dos^ of Letx. The animals are 
momtored for morbidity and moitahty at regular intervals following challeoge. 

b) JElabbit immuoizations. 

S The rabbit has increasingly gained accqptance as a relevant animal 

model to evaluate efficacy of vaccines against B. anihracis. The plasmid 
constmcts described in Example as well as similar plasmid constructs 
conqrrising native coding regions ^coding native PA and LF» as well as 
enqity control plasmids, are tested in vivo in rabbits by the following method. 

10 Plasmid vaccination of labbits is done at four-week intervals. At each time 

point, the animals (i:rf-4) receive an IM iiyection (quadriceps) of 500 jAg 
(ImgM) of DNA in 150 mM sodium phosphate formulated with the adjuvant 
Vaxfectin™ at a 4:1 DNAiVaxfectin^ mass ratio, each animal receiving a 
total of 3 injections (1500 fig/animal). Alternative DNA fonnulations include 

15 other adjuvants as described haein» for exanq)le, CRL1005/BAK {see Sbriver, 

LW. et at. Nature 475:331-335 (2002), and P.CT. PubHcation No. WO 
02/00844 A2), and/or trans&ction-faciHtatmg cationic lipids, e.g., 
DMRIE/DOPB at a 4:1 DNArHpid mass ratio. Serum san^les are taken at 
Day 0, 42, and 69 to detemiine antibody titers. The animals recdve an 

20 aerosolized challenge on Day 70. 

Rabbits are challenged in a BSL-3 facility (available, for eKanQ>le, at 
the Battelle Medical Research Bvaluation Facility (MREF) in West JefBsrson, 
OH) by standard mefliods. See, e.^,, Henderson, DW J. Hygiene 50:53-68 
(1952)). The Battelle facility has the equ^ment, stafi^ and certification to 

25 safely conduct a aerosol challenge of large mammalfi using infectious and 

toxin producing B. anthracis. Vaccinated animals are transferred to Battelle' s 
feciUty in West Jefiferson, and then, after a lACUC proved holding isolation 
period, the animals are challraged with between 50 and 100 LD50 aerosolized 
B. anthracis spores by inhalation. The animals are nu>mtored for morbidity 

30 and mortality at regular intervals following challmge. 
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c) Non-huznan primate immimizations. 

The pksmid constnicts desodbed in B3ca2iq>les 3-5, as well as smiilar 
plasDsid constmcts con^pnsmg native coding regions encoding native PA and 

S LF, as well as enq>ty control plasmids, are tested in vivo in non-fauman 

primates by die following mcdiod. Cynomolgus macaques (M fasdadarbs) 
are used for immunization and challenge experiments. Plasmid vaccination of 
die macaques is done at fbur-wedc intervals. Animals receive 1 to l.S mg 
eadbi of DNA at each immunization bilaterally (2 to 3 mg total) 

10 intramuscularly, in the deltoid muscle. Following immunization, all animals 

are challenged by puhnonary deliv^ of aerosolized A anthracis, 

(Q Human immunizatioiis. 

The plasmid constmcts described in Examples 3-5, as well as similar 
15 plasmid constmcts con^rising native coding regions encoding native PA and 

LF, as well as etnpty control plasmids, are tested in vivo in healthy human 
. volunteers by the following method. The plasmids are formulated in 150 mM 
sodium phosphate, optionally including Vaxfectm^ at a 4:1 DNA: 
Vaxfectin™ mass' ratio, and or a poloxamer, e,g., 0.01% (w/v) Pluionic® R 
20 25R2, Vaccinations are given at 0, 4, and 8 weeks intramuscularly into the 

deltoid muscle eiih^ by needle injection or by needleless Biojector (see^ 
ftg., Wang, R,etal. Proa NaO. Acad. Sci. USA P&iq817-10822 (2001)). The 
vohmteers receive 1 to l.S mg each of DNA at each immunization. Following 
immunization, soum spedmens are collected fiom die volunteers and tested 
25 for antibodies to B. anthracis LP or PA. 

e) Laboratoiy animal, conipanion animal, or fi>od animal immunizations. 

Plasmid constructs such as those described in Examples 3-5, are 
prepared using codon-optimized coding regions optimized &r the species of 
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interest using an appropriate codon-u^e table, eg.. Table 3 (mouse), Table 4 
(domestic cat), or Table 5 (cow). Codon optimization may be carried out by 
using relative frequencies for the codons, or by using the most frequent codon, 
as described herein. Plasmids conqnising these coding regions, plus similar 
S plasmid constructs comprising native coding regions encoding native PA and 

IF, as. well as empty control plasmids, are tested in vivo in various animal 

r 

spedes by the following mdiiod Hie animal species of interest is imTminized 
with an ^ropriate amount of a DNA vaccine codon-K)ptimiz;ed for that 
species, at an iqypropriate amoun:^ defivered in an appropriate route for that 

10 spedesi including, but not limited to tiie following immunization strategies: 

for mouse immonization^ intramuscular delivery into the rectus femoiis 
muKle of 50 ^g DNA in ISO mM sodium phosphate (1 mg/miyi% at 0, 14 
and 28 days; for cow inununizalion, intradermal delivery into the ear of SCO 
)ig DNA in normal saline (1 mgi^) at days 0 and 21 (see, e.g, van Drunen 

15 Little-van den Huik et al. X Gen, Virol 7P:831-839 (1998)); and for domestic 

cat immunization, intradermal delivery of 300 |Jig DNA in normal saline (1 
mg/m^ at days 0, IS, and 30 {see, eg., Osorio, JE, et al Vaccine 77:1109- 
1116(1999)). 

EXAMPLB9 

20 Immunological Asays 

a) ELISA for LP and FA Antibody Titers. 

Microtiter plates are coated with either PA or LP antigen by incubating 
100 ngAveH of purified protdn (obtained from list Biological Laboratories, 
Campbell, CA) ovemigjit at 4% in 100 mM caibonate bufifer, pH 9.6. The 
25 wells are washed (3X) wilfa 10 mM Tiis-buffered (pH 7.3); ISO mM NaQ 

(TBS) followed by a 1% (w/v) BSA block. Serially diluted experimental and 
control SOTun san^les in TBS + 0.05% Tween are added to the wells and 
incubated for 60 min at room tenqpecatore. Bozyme conjugated O^nseradish 
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peroxidase or alkaline phosphatase) anti^mousey axiti-rabbit, or anti-moiikey 
IgO are then added to each well and supematants monitored for enzyme 
prodnd Antibody titers are defined as the hig^^ dihition of a serum sample 
that results in an absorhance value 2X greater than that of a non-immune 
S control serum. Antibody quantification will he determined using a purified 

anti-PA and anti-LF Isl^l and IgG2 reagrat antibody. 

b) Toxin Neutralization Assay. 

Antibodies fiom vaccinated animals are initially tested using an in 
10 vitro assay that measures the neutralization of lethal toxin (Letx, Le., IP and 

PA protein) cytotoxicity. Briefly, this protection assay is carried out using 24 
hr. cultures of J774A.1 mouse macrophage cells maintained in microtiter 
platM (-6 X 10 '^cells per well) in DME media, with glucose and L ghxtamine 
s\9plements, and 7% fetal bovine serum at Serially diluted serum from 
IS vaccinated and control animals are mixed with letx and allowed to sit for 60 

muL The final Letx c(mcentratian will be brou^t to 3 pg^l. This mixture 
will then added to file J774A.1 cells and incubated for seven hours at 3TC. 
FinaUy» 100 ixl of 0.5 mgfwl 3-[4»S-dimefhyltfaiazol-2yl]-2,S- 
diphenyUetcazolium bromide (MTI) is added to each e^q^edmental well and 
20 allowed to incubate another 60 rnm. befbro assaying fi>r cytotoxicity. In this 

assay, surviving cells metabolize MIT into an insoluble purple pigpimt in a 
manner fiiat is proportional to viability. This msohible pigment is recovered 
fix>m viable cells and quandtated by absorption of 4S0nm light 

EXAMPLE 10 

25 Immunization using a Prime-Boost Strategy 

There is accumulating evidence to suggest ibst a naked DNA prime 
with a heterologous viral or protein boost will result in an enhanced Immoral 
response. Since the humoral response is widely believed to be tihe immune 
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correlate of protection against B. (mthraciSy in certain experimeiits a prime- 
boost strategy is used. The boost may be purified non-toxic LF and/or PA 
protein or the commercially available AVA vaccine. Alternatively 
recombinant virus vectors, eig., adenovirus vectors, expressing non-toxic LF 

S and/or PA may be used as the boost. Results are evaluated to compare 

antibody titers resulting from prime-boost immunization relative to DNA 
vaccination alone. 

New Zealand rabbits are immunized wilh a series of three plasmid 
irgections or two plasmid ii^ections with the plasmid constructs described in 

10 Examples 3-S, as well as similar plasmid constructs conqirising native coding 

regions encoding native PA and LF, as well as eEzq>ty control plasmids, 
followed by a singile dose of recombinant PA and/or LF protein (1 microgram 
in Alhydrogel) or the AVA vaccine (5 microliters). Controls include 
immunization wilfa the codon optimized and control plasmid constmcts alone, 

15 and mock immunizations. Following the iinmuiiization series^ ag., two 

plasmid DNA immunizations at four week intervals followed by a boost at 
week 12, total antibody titers and neutralizing titers are determined. In 
addition, selected inmiimized animals are challenged with a SOQx LDSO dose 
of aeiosoUzed anthrax spores at Battelle Medical Research Evaluation Facility 

20 in West Jefferson> OH as described in Example 8. 

EXAMPLE 11 

Immunization of Mice Using Codon-Qptimized B. anthracis DNA Vaccines 

a) E^eriment 1 

Six groups (Groups A-F) of 5 Balb/c female mice were hgected 
25 bilaterally in the rectus femoris muscle with SO pi of DNA solution (at 1.0 

m^ml) (100 pi total/mouse), on days 1 and 21 and 42 witii each of the 
followmg plasmids: 
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Group lA: VR6290 CrPA-PA63, Hg. 1. SEQ ID N0:1, prepared as 
described in Bxamplt 3a); 

Group IB: VR6291 CEPA-PA63AFF, Fi& 2, SEQ ID N0:5, prepared 
as described in Example 3b); 

5 Group IC: VR6292 (TPA-PA83Afiirin, Fig. 3, SEQ ID N0:7, prepared 

as described in Exa]I^>le 3c); 

Group ID: VR6295 (TPA-LF HEXXH, Fig. 4, SEQ ID N0:9, 
prq)ared as described in ^cample 4a); 

QtGup IE: VR6290 (50 jig) +VR6295 (50 [ig), co-injected; and 

10 Qrovp IF: VR1012 (empty expression vector). 

The plasEoids listed above were formnulated as follows. Onb vial (0.5 
mg) of MPL+TDM adjuvant^ purchased fiom Sigma/Aldrich (catalog # 
M6S36) was resuspended in 150 mM Na2P04 according to xnanufiu^turen^ 
IS instructions. Fifiy microliters of DNA solution was mixed 1:1 (vAr) widi the 

MPL+IDM emulsion and injected into each moose at the times specified 
above. 

Mice were bled for serum on days 0 (prebleed) , 20 (bleed 1), and 41 
(bleed 2), and 62 (bleed 3). PA antibodies were measured in each of Groups 

20 ' lA-lC, IE, and IF, UP antibodies were measure! in each of Groups ID, IE, 
and IF, and LT neutralizing antibodies were measured in each of Groups 1 A- 
IE. All assays were done as outlined in Example 9. The geometric mean of the 
anti-PA and anti-LF titers were calculated following each bleed. The results 
are shown in Figs* 15A and 1 SB, respectively. In Fig. 15C, the serum fiom 

25 each mouse was tested for LT neutralizing antibody titer after the last DNA 

immunization (bleed 3) according to tiie procedure in Example 9. The mean 
neutralizing tit^ for each group of mice was calculated and plotted and tiie 
error bars represent one standard deviation fiom the mean, 
b) Experiment 2 

30 Ei^ gioups of 5 mice each (Groups lA-TH) were iitjected bilaterally 

in ti» rectus fiamoris witix 50 pi (50 ^g) of DNA solution (100 \il (100 ^g) 
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total per mouse), adiuvanted with^dPL + TDM as described in section lla» on 
days 1, 21, and 49 with the following combinations of plasmids: 

Giox^ 2A: VR-6290 (50 ^lg) + VR-1012 (50 |ig); 

Group 2B: VR-6291 (50 fig) + VR-1012 (50 ng); 
5 Gioiip 2C: VR-6292 (50 ^g) + VR-1012 (50 jig); 

Group 2D: VR-6295 (50 |ig) + VR-1012 (50 jig); 

Group 2E: VR-6290 (50 \ig) + 50 jig VR-6295 

Groi^ 2F: VR^91 (50 fig) + VR-6295 (50 jig); 

Group 2G: VR^292 (50 ^g) + VR-6295 (50 jjfi); and 
10 Group 2H: VR-1012 (100 jig). 

Mice were bled for serrnn on days 0 (prebleed) , 20 (bleed 1), and 41 
(bleed 2), and 62 (bleed 3). PA antibodies were measured in each of Groups 
2A-2C and 2E-2H, LF antibodies were measured in each of Groups 2D-2H, 

15 and LT neutralizing antibodies were measured in each of Groirps 2A-2G. All 

assays were done as outlined in Bxan^le 9. The geometric mean of the anti- 
PA and anti-LF titers were calculated following each bleed. The results are 
shown in Figs. 16A and 16B, respectively. In Fig. 16C, the serum fiom each 
mouse was tested for LT neutraliring antibody titer after the last DNA 

20 immunization (bleed 3) according to the procedure in Bswaple 9. The mean 

neutrafizing titer for each gfoup of mice was calculated and plotted and the 
error bars represent one standard deviation fiom the mean, 
c) Experimoit 3 

Four groups of 5 mice each (Groups 3 A-3D) were injected bilaterally 
25 in the rectus femoris with 50 (il (50 jig) of DNA solution (100 yH (100 jig) 

total per mouse)^ adjuvantcd with MPL + TDM as d^ribed in section 1 la, on 
days 1, 21, and 49 with the following combinations of plasmids: 

Group 3A: VR-6292 (50 ^g) + VR-1012 (50 fig); 
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Group 3B: VR-6292 (50 jig) + VR-62952 (50 ^g, TPA-LF Domain I- 
in, Fig. 5, SEQ ID N0:13, prepared as described in Exan^le 4b); 

Qto^xp 3C: VR-6292 (50 jig) + VR-62951 (50 \xg, TPA-UF Domain B, 
Fi& 14, SEQ ID NO:39, prepared as described in Example 4d); and 

5 Group 3D: VR-6299 (50 fig, TPA-Sugar minus PA63, Figure 7, SEQ 

ID NO:17, prepared as described in Example 5a) + VR-1012 (50 jig). 

i 

Mice were bled for serum on days 0 (prebleed) , 20 (bleed 1), and 41 
(bleed 2% and 62 (bleed 3). PA antibodies were measured in each of Gioi^s 

10 3A-3D, LF antibodies were measured in each of Groups 3B and 3C, and LT 

neutrali^dng antibodies wore measured in each of Groins 3A-3D. All assays 
vrm done as outlined in ^cample 9. The geometric mean of the antt-PA and 
anti-LF titers were calculated following each bleed. Hie results are shown in 
Figs. 17A and 17B, respectively. In Fig. 17C, the serum fiom each mouse was 

IS tested for LT neutralizing antibody titer after the last DNA immunization 

(bleed 3) according to the procedure in Example 9. The mean neutralizing 
titer for each gicovp of mice was calculated and plotted and the oror bars 
represent one standard deviation fiom the mean, 
d) E]q)eriment4 

20 Four groiq>s of 10 mice each (Choi:^ 4A-4D) were injected bilaterally 

in the rectus femoris with 50 )j1 (50 ^g) of of plasmid VR.-6292 (100 }il (100 
^g) total per mouse), fiimmlated with various adjuvants, on days 1, 21, and 
49, as follows: 

Group 4A: VR-6292 formulated wifli CRL 1005/BAK; 

25 Group 4B : VR-6292 formulated with MPL + TDM, as described in 

section lla, ji(pra; 

Groiq) 4C: VR-6292 foimulatedwithVaxfectm™ata4:l DNA: 
Vaxfectin™ mass ratio; and 

Group 4D: VR-6292 formulated with DMRIBJK)PE (1 :1 molar ratio) 
30 at a 4:1 DNAdipid mass ratio. 
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The plasmids in Qxoup 4A were fonnulated as follows. Hie poloxamCT 
CRL1005 (fix)m Organichotn) and a solution of b^izyl-alkonium chloride 
"BAK" (fix>in Ri^er Chemicals) were added sequentially to plasmid solutions 
in PBS* The initial plasmid/poloxamer formulation was prepared to contain 5 
5 mgtoL plasmid DNA, 7.5 mg/mL(3lL1005ai^ 03 mM These initial 

preparations were diluted 1:1 (volivol) with PBS, thm cold sterile J61ta:ed. 
Further dilution with steiile PBS wasdone just prior to use to provide the final 
woddng concmtration of 1 mgfmL pDNA, 1.S mg^ QULIOOS aod 0.06 
mMBAK. 

10 Mice were bled for serum on days 0 ^rebleed) , 20 (bleed 1), and 41 

(bleed 2), and 62 (bleed 3). PA antibodies were measured m each of Groups 
4A-4D» andLT neutralizing antibodies were measured in each of Groups 4A* 
4D. All assays were done as outlined in Bxample 9. Hie geometric mean of 
the aiiti-PAtitei:swa:e calculated following each U The results are shown 

15 in Hg. ISA In Fig. 18B, the serum fiom each mouse was tested foi LT 

neutralizing antibody titer after the last DNA immunization (bleed 3) 
according to the procedure in Example 9. The mean neutralizing titer for each 
ffoxsp of mice was calculated and plotted and &e error bars represent one 
standard deviation &om the mean. 

20 e) Experiments 

Six groups of 10 mice each (Groi^ SA-SF) w^ injected bilaterally in 
the rectus femoris with 50 pi (50 jig) of of plasmid VR-6292 (100 fil (100 \ig) 
total per mouse), formulated with various acyuvants, on days 1, 14» and 28, as 
follows: 

25 Group 5A: VR-6292 fi>rmulated withMPL + TDM, as described in 

section 11a, siipra\ 

Group 5B: VR-6292 formulated with MPL-A aqueous 1000 pg/^mL 
(Corixa) mixed 1 : 1 (vAr) with DNA; 

Groiq> 5C: VR-6292 formulated with CRL 1005/BAK, as described in 
30 section lid, stfpra; 
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Groi5)5D: m-6292fonniilatedwifhVaxfectm^ata4:lDN^ 
Vaxfecdn™ mass ratio; 

Qsovp SB: VR-6292 formulated wiiiiDMRIEaX)PE (1:1 molar ratio) 
at a 4:1 DNA:ljpid mass ratio; and 

Group 5F: VR-6292 formulated wifli IX MS. 

Mice we bled for serum on days 0 ^rebleed) , 13 (bleed 1), and 27 
(bleed 2\ and 56 (bleed 3). PA antibodies were measured m each of Groups 
SA-5F after each bleed, and LT neutralizing antibodies vrm measured in each 
of Gioiq»s SA-5F after bleed 3. AH assays were done as outlined in Bxanxple 9. 
The geometric mean of the antt-PA titers were calculated following each 
bleed. The results are shown m Fig. 20. In Fig. 21, die seram fix>m eadi 
mouse was tested for LT neutralizing antibody titer after the last DNA 
inrniunization (bleed 3) according to the procedure in Example 9. The mean 
neutralizing titer for each group of mice was calculated and plotted and the 
error bars represent one standard deviation fiom the mean. 



EXAMPLE 12 

Immunization of Rabbits Using Codon-Optimized A anihrads DNA Vaccines 

Twelve (12) groups of 10 rabbits each (Groups A-O and I-M, for DNA 
vaccinaticms) and one group of 4 rabbits (Group H, for Ihe AVA vaccination) 
(pryctolagids cunicubis. New Zealand albino rabbits, 2-S kg each at onset of 
treatment) were used in this expedement The rabbits in Groups A-G and I-M 
received a 500 ^g intramuscular injection in each quadricep muscle (bilateral) 
for a total of 1 mg of plasmid DNA pesc rabbit per immunization. Injection of 
the formulated plasmid DNA took place on days 0, 28, and 56. Some animals 
received only the first two plasmids injections on days 0 and 28 (denoted 2 
injs in Fig. 19). All rabbits w^e prebled two days before the first 
inrniunization and bled again on days 14^42, and 70. 
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Unl^ noted, the various formulatioDs were administered by a bilateral 
intramuscular injection into the qtiadriccps muscles on Days 0, 28, and 56 with 
a needle* Tlie dose volume to be administered is 500 ^U/muscle, 1 naVanimaL 
The rabbits in (koxsp D were vaccinated uang a Biojector, as follows. 
5 Animals wero anesthetized using ketamine/^cylazina The skin over die 

injection site was shaved* and the dose volume administered was 500 
p]/nniscle^ 1 nd/aninQial. The vaccination g;roups were as follows: 

Group A: VI16292 formulated with Vaxfectin™ at a 4: IDNA: 
Vaxfectm™ ratio; 

10 Group B: VR6292 (500 ^g) + VR-62952 (500 pg) formulated wiA 

Vaxfectin™ at a 4:1 DNA: Vaxfectin™ ratio; 

Group C: \ai6292 formulated with DMRffi/DOPB at a 4:1 
DNAdiqpid ratio; 

GroupD: VR6292 formulated wiihVaxfectin™ at a 4:1 DNA: 
15 Vax&ctin™ ratio, delivered by Biojector, 

Qtonp E: VR6292 (500 jig) + VR-62951 (500 jig) formulated with 
Vaxfectin™ at a 4:1 DNA: Vaxfbctin™ ratio; 

QraapV: VR6290£brmulatedwilhVaxfectin™at a4:l DNA: 
Vaxfectin™ ratio; 

20 Groi^G: VRfi292 formulated with Vaxfectin^ at a 4:1 DNA: 

Vaxfectin™ ratio (two iiyections only); 

Groiq)H: 0>nmie(rdal anthrax vaccine AVA, 50 fil, delivered on day 
28 and 56 by a single IM inj ection; 

Gro\9l: VR-^2951fonnulatedwi1faVaxfectin™ata4:lDNA: 
25 Vaxfectin™ ratio; 

Group J: VR6292 (500 jig) + VR-62951 (500 pg) formulated with 
Vaxfectin™ at a 4:1 DNA: Vaxfectin™ ratio (two injections only); 

GroqpK: VR-62952 formulated witti Vaxfectin™ at a 4:1 DNA: 
Vaxfectin™ ratio; 

30 Gro^pL: VR6292 formulated T^tfaMPL-Aaqoeous 1000 ^g^mL 

(Onixa) mixed 1:1 (v/v) witii DNA; 
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GroupM: W6292 fonnulated wifh OO^IOOS/BAK, fomulated as 
described in Bxainple lld» stqira. 



The LT neutralizatioii assay was performed on all rabbit sere &om fhe 
5 day 70 bleed. The median titer + one standard deviation is shown for each 

groiqpinFig* 19. 



EXAMPLE 13 



10 Immumzation and Challenge of Rabbits Using Codon-Optimized B. anihracis 

DNA Vaccines 

Ten groiqps of rabbits (Qryctolagus cuniadus. New Zealand albino 
zabbit8» 2-5 kg each at onset of tteatmenl;, ten (10) animals per groi^ miless 
otherwise noted) were used in this experiment These included selected groins 
15 of ammals described in Bxaniple 12, as noted below. The various plasmid 

DNA formulations were adnunistered by a bilateral intramuscular injection 
into the quadriceps muscles on Days 0, 28, and 56 with a needle. The dose 
volume to be administered is 500 fd/muscle, 1 ml/animal The vaccination 
groups were as follows: 

20 GioiQ)l: VR6292 formulated with Vaxfectin™ at a 4:1 DNA: 

Vaxfbctm^ ratio (Group A from Example 12); 

GT0iq>2: VR6292fonnulatedwithVaxfectin™at a4:l DNA: 
Vaxfectin™ ratio (two injections only) (Group G from Bxjmple 12); 

Groups: VR6292formulatedwithDMEaE/DOPEata4:l 
25 DNAilipid ratio (Groq) C from Exanq)le 12); 

Groiq) 4: VR6292 (500 ^ig) + VR-62951 (500 ^g) fonnulated with 
Vaxfectin™ at a 4:1 DNA: Vaxfectin™ ratio (two injections only) 
(Groi^ J from Exanq)le 12); 

Groi5)5: VR6292 (500 ng) + VR-62952 (500 ^g) formulated with 
30 Vaxfectin™ at a 4: 1 DNA: Vaxfectin™ ratio (Groi^ B from Bxan^le 

12); 
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Qroupfi: VR-^2952fomTilatedwithVaxfectm^ 
Vaxfectin™ ratio (two aninuils) (Qrovp R from Eicaniple 12); 

Group?: W1012£iOTiimlfltedwi1iiVarf«^ 
Vax&ctin™ latio (ibur aniinals); 

5 Qroxtp 8: VR1012 fomwlated with DMRIE/DOPE at a 4:1 DNAOipid 

latio (two animals); 

Group 9: Ck>inmeicial anthrax vaccine AVA, 50 pi, deliveEred on 28 
and 56 by a single M injection (Groiq>lfiomB3imq>le 12); and 

Group 10: Twelve imvacdnated animals. 

10 

The labbits in Groups 1-^ roceived a 500 [ig intramuscular injection in 
each quadricep muscle (bilateral) for a total of 1 mg of plasmid DNA per 
rabbit per ixnmumzation. In groups 1> 3, S, 6, 7, 8, and 9, three injections of 
the formulated plasmid DNA took place on days 0, 28^ and 56. In ffovps 2 

IS and 4, two mjections of the fonnulated plasmid DNA took place on days 0 and 

28. In group 19 commercial anthrax vaccine AVA» 50 pl» was iigected 
intramuscularly on days 28 and 56. AH rabbits were prebled two days before 
tiie first immunization and bled again on days 14, 42, and 70. 

Over a four-day period on or around day 70 (indicated in Table 17 as 

20 "challenge days" C1-C4), the rabbits were challraged by aerosol 

administration of B. anthracis (Ames strain) spores by standard m^ods. See^ 
e,g., Henderson, DW J. Hygiene 50:53-6S (1952)). Challenge doses ranged 
fiom about 50 LD50 equivalents to about 250 LD50 equivalents as noted in 
Table 17 below. The animals w^ monitored for moAidity and mortality at 

25 regular mtervals out to days 19-22 (depending on the challenge day) 

following challenge. The results are shown in Table 17, and are summarized 
in Table 18. "NC" denotes ''not challenged." 
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TABLE17 













1.1 


C3 


123.5 


Y 


15 


NC 






1.3 


C4 


113.5 


V 

T 


1.4 


CI 


56.4 


Y 

I 


1.5 


C3 


92.7 


Y 


1.8 


C4 


66.3 


Y 


1.7 


CI 


103 


Y 


1.8 


C2 


127.3 


Y 


1.9 


NC 






1.10 


C2 


128.8 


Y 




«r>2i-:i;;;!lii;i;!i!iii(;ii!i;;;^^i 




2.1 


C1 


78 


Y 


2.2 


C4 


70.5 


Y 


2.3 


NC 






2.4 


C4 


52.1 


Y 


2.S 


CI 


252.1 


Y 


2.6 


C2 


119.1 


Y 


2.7 


C3 


SZA 


Y 


2.8 


NC 






2.9 


C2 


71.9 


Y 


2.10 


C3 


195.1 


Y 






3.1 


C2 


55.7 


Y 


3.2 


C3 


238.3 


Y 


3.3 


C4 


110 


Y 


3.4 


CI 


208.1 


Y 


3.5 


NC 






3.6 


CI 


142.9 


Y 


3.7 


C3 


169 


Y 


3.8 


NC 






3.9 


C4 


57.5 


Y 


3.10 


C2 


74.7 


Y 








4.1 


C3 


87.3 


Y 


4.2 


C4 


902 


Y 


4.3 


CI 


81.6 


Y 


4.4 


NC 






4.5 


C2 


100 


Y 


4.6 


C2 


72 


Y 


4.7 


C1 


76.1 


Y 


4.8 


C4 


92.8 


Y 


4.9 


NC 






4.10 


C3 


206 


Y 
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5.1 


NC 






52 


CI 


192.2 


Y 


5^ 


C2 


152.6 


Y 


5.4 


C4 


66.6 


Y 


5.5 


C3 


135.7 


Y 


5.6 


C2 


85.1 


Y 


5.7 


C4 


79 


Y 


5.8 


C1 


126.6 


Y 


5.9 


C3 


154.7 


Y 


6.10 


NC 








ii6-6^i;"-}^V:i-.n!?tiiti3';if-|ii 


. -.i-f. .I;:l;l'hl 


6.1 


C4 


117.7 


Y 


6.2 


C3 


241.4 


N(D4) 


6.3 


NC 






6.4 


CI 


107.3 


Y 


6.5 


C4 


58.7 


N(04) 


. 6.6, 


C3 


121 


Y 


6.7 


C3 


160.8 


Y 


6.8 


C2 


46.1 


N(D7) 


6.9 


CI 


195.2 


N(D6) 


6.10 


C2 


94.5 


Y 






7.1 


C3 


101.9 


N(D3) 


72 


C4 


144.1 


N(02) 


7.3 


NC 








CI 


1082 


N(D2) 










8.1 


C2 


63 


N(D3) 


8.2 


C4 


582 


N(D3) 








9.1 


C2 


113.4 


Y 




CI 


106.9 


Y 


9J3 


C3 


157.6 


Y 


9.4 


C4 


175.6 


Y 


y!il«i^fl:.itilllf!ilBiM;?i^iir 






fieyiiRiiiii^iiiiiiitiii 


10.1 


C4 


76.7 


N(D2) 


10.2 


C3 


207.6 


N(D3) 


10.3 


C2 


91.5 


N(D2) 


10.4 


C4 


176 


N(D2) 


10^ 


C2 


123 


N(D3) 


10.6 


C2 


95.4 


N(D3) 


10.7 


C4 


91.5 


N(D3) 


10.8 


C1 


1652 


N(D2) 


10.0 


CI 


57.3 


N(D3) 


10.10 


C3 


163.8 


N(D4) 


10.11 


C3 


1142 


N{D2) 


10.12 


CI 


02.3 


N(D2) 
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TABLB18 





':&ttViM|; . 


1 


8/8(100%) 


2 


8/8(10090 


3 


8/8 (1009i) 


4 


8/8 (100%) 


S 


8/8(100%) 


6 


5/9(56%) 


7 


0/2(0%) 


8 


Ofl 


9 


4/4(100%) 


10 


0/12(0%) 



5 . EXAMPLE 14 

TmniuTiization of Mice using Siogle Vial Fomuilatioiis 

Single vial foimulations were prepared by teconslituting bulk DMRIE 

10 and IX)PB lipids to fommulti-lameUar vesicles These vesicles were 

then fintber processed to produce small DMRIE and DOPE liposomes (SUV) 
and sterile filtered IbiDugh a 02pm membrane. The formulations were 
prepared asepticaUy at room temperature by adding sterile plasmid DNA and 
sterile DMSIE:DOPB SUV liposomes into separate feed lines and then 

15 combining into a third sterile vessel via in-line nubdqg. Moderate rates of 

addition and moderate in-vessel mixing were used to fonn a li|dd4)lasmid 
DNA contplex. Preparation of lipids and lq>id^lasmid DNA CQnq>lexes is 
described in Zel^hali et al. Gene Therapy 5: 1277-1282 (1998) ^ch is 
incorporated herein by reference in its entirety. The formulations described 

20 below contain final molar ratios of 4: 1 or 2: 1 plasmid DNA to DMRIR 

Eight g;roiq}8 of mice, containing 10 mice in each group, were injected 
bilaterally in the rectus femoris muscle with the various formulations 
described below. Each injection contained SO}xg of pmifled plasmid VKfi292 
(PA83Afurin) in a volume of 0.1ml. At 0» 2 and 4 weeks the groiq)s were 

25 hgected with die following formulations, all containmg 50)xg of VR6292 
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plasmid DNA (prepared as described in the plasoud DNA puiificatioxi section 
prior to Example 1). 

Groi^ A: Unextruded MLV, in a 4:1 molar ratio of plastnid DNA to 
DMRDB, in PBS 7.2). Hie fonnuIatiQn was fresbly prepared just 
prior to ixgectiorL 

Group B: Unextruded MLV, in a 4:1 molar ratio of plasmid DNA to 
DMRIB^ in 10% sucrose and IQmM sodium phosphate, pH 12. The 
fonnulation was freshly prepared just prior to injedian. 

Groi^ C: 0*2 ^m filter extruded (iSUV) liposomes, in a 4:1 molar ratio 
of plasmid DNA to DMRIE, in 10% sucrose and lOmM sodium 
phosphate (pH 1.7). The formulation was stored overnight at 2-8°C 
prior to moculation. 

Group D: SUV liposomes^ in a 4:1 mtio plasmid DNA to DMIUE, in 
10% sucrose and lOmM sodium phosphate, pH 7.2. The formulation 
was frozen prior to inoculation. 

Group E: SUV liposomes, in a 4:1 molar ratio of plasmid DNA to 
DMRIE, in 10% sucrose and lOmM sodium phosphate, pH 7.2- The 
formulation was lyopMUzed prior to moculatioiL 

GroiQ) F: Unextruded MLV, containing cholesterol in place of DOPE, 
in a 4:1 molar ratio of plasmid DNA to DMRIE, in 10% sucrose and 
lOmM sodium phosphate, pH 7.2. The fbrmulation was freshly 
prq>ared just prior to injection. 
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Oioup G: Unextruded MLV, in a 2:1 molar ratio of plasniid DNA to 
DMRIB, in PBS, pH 12. The fbtmulatLon was fieshly prepared just 
prior to injectiorL 

S Group H: SUV fiposomes, in a 2:1 molar ratio of plasmid DNA to 

DMRIB, in 10% sucrose and lOmM sodium phosphate, pH 12. The 
foxmulation was stored ovetni^ at 2-8°C prior to injection. 

Mice were bled for serum prior to each DNA immunization at week 0 
10 . (Prebleed), week 2 (Bleed 1), week 4 (Bleed 2) and four weeks post die last 
injection (Bleed 3). Anti-PA IgO antibody titen and neutralization of lellial 
toxin O^tK) titers were perfbnned as described in Em^ The antibody 
titos and neuttalization results for each bleed and every formulation tested are 
shown in Tables 19 and 20. 

15 

TABLE 19: Anti - PA IgG Titer 















Geometric Mean 


Prebleed 


80 


80 


80 


80 




Bleed 1 


10975 


4165 


7760 


4457 




Bleed 2 


305736 


66540 


108094 


62084 




Bleeds 


1616014 


37640S 


376405 


655627 




StdBev. 


Piebleed- 


0 


0 


0 


0 




Bleed 1 


11372 


7630 


23983 


12421 




Bleed 2 


215705 


58765 


84998 


51642 




Bleeds 


639310 


370406 


343674 


1200361 






ms.: 1 Itlil!!. 


!ii!il!lliiiK;.ift;i; 


1 i.i-rj vF' • • ■; 






Geometric Mean 


Prebleed 


80 


80 


80 


80 




Bleed 1 


7760 


5487 


9554 


4457 




Bleed 2 


81920 


71316 


327680 


76434 




Bleeds 


1310720 


266159 


1310987 


351199 




Std.Dev. 


Prebleed 


0 


0 


0 


0 




Bleed 1 


15017 


8172 


23498 


26219 




Bleed 2 


116014 


55555 


205073 


53970 




Bleeds 


0 


197337 


678738 


221840 



20 
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TABLB 20: Letx Neiitratizing Antibody Titos 





mmma. 






m 




Mean 


184 


226 


160 


149 


StiLDev. 


217 


372 


178 


390 












I.'- if; 

ill 




Mean 


92 


86 


211 


35 


StdDev 


111 


89 


212 


46 



5 

EXAMPLE IS 

Immunization of Non-Human Primates Using Codon-Optimized B, anikracis 
10 DNA Vaccines 

Three gcoupa of cynomologous macaques (M. fasiadarts\ containing 
three monkeys in each git>i9, were used in this expe^ The animals were 
immunized unilaterally, intramuscularly, in the deltoid muscle with a 

15 Biomjector device. Varying amounts of purified VR6292 (PA83Afurin) 

plasmid DNA (prepared as described in the plasmid DNA purification section 
prior to Example 1) formulated with Vaxfectin™, in a 4:1 molar ratio of 
plasmid DNA to lipid, was used in all inoculations in this study. All animals 
received injections at month 0, 1 month, and 2 months. Group 1 received 

20 20ng of plasmid DNA at each inoculation. Group 2 received lOOng of 

plasmid DNA at each inoculafioa Group 3 received 200)xg of plasmid DNA 
at each inoculation. 

The monkeys were bled fi)r serum prior to each DNA immunization at 
monOi 0 (Bleed 1), mcmtfa 1 (Bleed 2), month 2 (Bleed 3) and at four weeks 

25 after the last irg'ection (Ble^ 4). Anti-PA IgG antibody titers and 

neutralization of lethal toxin (Letx) titers were perfaimed as described in 
Exan^Ie 9. The antibody titers and neutralization results for each group of 
animals are shown in Tables 21, 22 and 23. 



wo 2004/024067 



FCnAIS2003/028199 



-111- 

2 out of 3 animals in Qroxsp 1 generated an anti-PA IgG titer. One of 
the animals generated a sizable titer (20,000) after three injections. This titer 
is comparable to flie titers of the animals in groiq>s receiving hi^er doses of 
plasmid DNA (Gn>iq)s 2 and 3). None of the animals in Qrovp 1 generated 
5 any measurable neutralization ^vity of Letx at the lowest dilutions tested 

(serwn diluted 1:20). 

The animals in Gioips 2 and 3 generated similar immune responses to 
the inoculations. All monkeys in both groi^ developed anti-PA IgO titers* 
Letx neutralization titers were generated in 2 out of 3 monkeys in bofli gioiq>s. 
10 The remaining animal in each groiq) had measuxable neutralization activity, 

but below the level needed to score atiter. 



TABLE 21: Group 1 - Anti-PA IgO and LefX Neutralizing Titers 













Aiiixnal# 




1001 


1002 


1003 




Bleedl 


80 


160 


640 




BkedS 


80 


640 


10240 




meeAA 




2560 


20480 
















iilili 








Bleed 3 


0 


0 


0 




Bleed 4 


0 


0 


0 













TABLE 22: Qmxsp 2 - Anti-PA IgO and LsOL Neuttalizing TiteiB 





:!ii^3iiiiW5A:iw5;;?^ 




:l;,;.-s;i!i!!:-;>f!{fi!;« 




m 


Aoiiiisl # 




2001 


2002 


2003 




Bleed 2 


640 


10240 


5120 




Bleeds 


10240 


20480 


40960 




Bleed 4 


40960 


40960 


81920 












'>!lil;r|l»!|i.,:;^li 
ii-'iii'ukrf?! 






iiiS! W 


Vj||fii^:i; 






Bleed 3 


0 


**« 






Bleed 4 




40 


80 













denotes a detectable low level of neuttalizBtiaii acdvxty 
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TABLE 23: Qtoxsp 3 - Anti-PA IgG and LeiX Neutralizing Titers 











"• r. 1. .-^.''.j^'ii! 


Animal ^ 




3001 


3002 


3003 




Bleed! 


640 


320 


640 




Bleeds 


5120 


20480 


10240 




Bleed4 
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20480 


81920 
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Bleed 3 


0 


40 






Bleed 4 




160 


80 













deziatBa a detectabk low levd of neixtn^^ 



5 EXAMPLE 16 

lomuimzation CbBlIcsnge of RabUts Using Codon-Optibiiized B. anthrads DNA 

Vacdnes 

10 a) Long-Tenn Immune Response in DNA Tmninnized Rabbits 

10 rabbits immunized, as described in Example 12, Group D 
(Immunized three times with VR6292X were followed long-term fi>r anti-PA 
antibody titer, LetX neutralization titer and protective inrnmne response to an 

15 anthrax ^re challenge. Anti-PA IgG antibody titers and LetX neutralization 

titers were performed as described in Example 9. The results of the titers and 
neutralization assays are shown in Table 24, Rabbits wece bled twelve times 
on tlie weeiks indicated in Table 24. 

On week 39 of the experiment, rabbits were challenged by aerosol 

20 admiiiistFBtion of B. anthrads (Ames strain) spores by standard methods as 

described in Exanqile 13. All rabbits survived Control animals tiiat were not 
vaccinated did not survive challenge. 



25 
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TABUB24 









Wedc 
ftKMt fiiBt ngectioa.) 


GeomctxicMeaiL 


StzLDev. 


2 


20480 


22744 


6 


2622775 


1635799 


10 


12909485 


6079129 


14 


6456057 


8244196 


Ig 


4565122 


7496131 


22 


2810448 


2931335 


26 


1311120 


1508050 


30 


1311120 


1508050 


34 


1405367 


888676 


39 


1U04/44 




40 


1505928 


1809833 




2129704 


1865167 








We* • 
(post iBist injection) 


OeonBldc Mean 


StdDev. 


6 


1576 


843 


10 


4457 


2956 


14 


2560 


1602 


18 


1372 


1441 


22 


1194 


1455 


26 


970 


607 


30 


905 


641 


35 


905 


641 


40 


844 


843 


41 


1040 


911 


43 


1194 


955 



5 b) Rabbit lauaunization Dosing with Intended Human Vaccine Product 

Sixty New Zealand White rabbits (30 males and 30 females), 
qiproximately 10-12 wedka old, were used for this study. Ten animals per sex 
were injected wifli the formulations described below. The plasmids were 
10 finmnlated vnth DMRIE/DOPB in a 4:1 DNA to Upid mass tatio in PBS, as 

described in Example Sb. 

G1019I: LQmlofPBS 

Groiv 2: 0*lmg of plasmid VRr6292 and O.lmg of 
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plasmidVR 62952. 

Group 3: LQmg of pksmids VR-6292 and l.Qmg VR- 
6295^ 

S All animals in the study recdved unilateral intramuscular injections into the 

vastus lateralus muscle at 0» 2, 4 and 8 weeks. 

Serum samples were taken fiom all study animals once during tbe pre- 
treatmentperiodandoncedQiingweek8 2,4, 6^8, 10andl2. Anti-PAandLF 
antibody titers and Letx nwtralizing antibodsr titers were evaluated using 

10 sorom sainples taken prior to immunization and at 8 weeks prior to the fourth 

DNA immunization. All immunological assays were performed as described 
in Bxanq)le 9. Anti-PF and LF antibody titers and Letx neutralizing antibody 
resulte for the bleeds taken at week 8 are shown m Tables 25 and 26. 

15 TABLE 25: And-PA and Anti-LF Antibody Titers (Geometric Mean) 



1 ■ 
















Asti-FA 


163840 


514211 


Anti-LF 


1^840 


678540 








Anti-PA 


230686 


595754 


And-IF 


386254 


730464 



TABLE 26: Letx NeutraHzatioa Titeni (Geometnc Mean) 

20 
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StdDev 


510 
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c) Post Challenge Inimiaie Rjesponses in Aerosolized Spore Challenged 
Rabbits. 

Six groi^ of rabbits, witti 10 individuals in each groiq), were 

5 immunized as described for Qtoups A-C, G, H and K in Example 12. 39 

weeks after the last immunization, these rabbits were chaUeoged widi anthrax 
aerosolized spores^ as described in Bxample 13. Control animals that had not 
been imniunized were also chaUenged as described in Exanq>le 13. No control 
animal survived the challoige, 

10 At one day prior to challenge, and at 7 and 21 days post challenge 

animals were bled for serum. And-PA and LF IgG antibody and LetX 
neutralizing titers were performed as described in £xanq>le 9. It should be 
noted that except as desoibed below, immunized animals had developed 
protective immunity since Ihey survived challenge. 

IS Hie imnnme responses post challenge cotild be divided into two 

groups: rabbits that showed no increase in immune response after challenge 
(lack of boosting) and rabbits that were boosted in their response to PA and/or 
' LF after spore challenge. 

All rabbits immmiized as decribed for Groups A-C, in Example 12 

20 (immunized with VR6292 or VR6292+VR62952, three times), demonstra±ed a 

lack of boosting. Two mbbits immunized as described for Gioiq> G, in 
Example 12 (immunized with VR6292 twice), had the lowest anti-PA titers 
pr&challenge and demonstrated a small post-challenge boost in anti-PA titer 
and the genemtion of an anti-LF response. 

25 Several rabbits immunized as described fiir Gmxp K, in Exanq)le 12 

(immunized wilh VR629S2 (LP[I-nr|)), did not survive anthrax spore 
challenge. The five surviving rabbits all had significant anti-PA titers post 
challenge. Additionally rabbits immmiized as described for Group H m 
Exanqple 12 (immimized twice wilb the commercial anthrax vaccine AVA), 

30 had no measurable anti-Ifieqpcmsepre-challGnge. After challenge all rabbits 
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sbowed a boosted anti-PA titer and the generation of a ^rong anti-LF 
response. 

In summary, all rabbits immunized two or fliree times with plasmids 
encoding PA or PA+LF generated strong immune r^onses and were able to 
S survive anliizax spore challenge. Ahnost all of these rabbits showed a lack of 

response boosting post-challenge, which is consistent with sterilizing 
immunity* In contrast, rabbits inmunized twice with SOpl of AVAexhib^ 
strong anti-LF response and a boosted anti-F A titer. 

10 EXAMPLE 17 

Mucosal Vaccination and Electrically Assisted Plasmid Delivery 

a) Mucosal DNA Vaccination 

15 Plasmid constructs conq>rising codon-optimized and non-codon- 

optimized coding regions encoding LP, PA or various fiagmeots, variants or 
dmvatives, as described herein^ are delivered to BALB/c mice at 0, 2 and 4 
weeks via ijoi.^ intranasal (in.), intravenous (Lv.)« inlravaginal (Lvag.), 
intrarectal (ix.) or oral routes.. The DNA is delivered unformulated or 

20 formulated with the cationic lipids DMKDB/DOPE (DD), DMRIE/Oiolesteiol 

or Vaxfectin™. Serum IgG titers agamst the various LF and PA antigens are 
m^sured as described in Kcample 9, as well as Letx neutralization titers. 

b) Electrically-assisted plasmid delivexy 

25 bi gene delivery may be erihanced through flie apphcstixm of brief 

electrical pulses to injected tissues, a procedure referred to herein as 
electdcaljly-assisted plasmid delivery. See, eg:, Aihara, H. & Nfiyazaki, J. 
Nat. Biotechnol 16:%61-70 (1998); Mir, LM. et al, Proc. Natl Acad. Sci. 
USA 96:^262-^7 (1999); Hartikka, J. et al, MoL Tker, 4A07AS (2001); and 

30 Mir, LM. et al.; Rizzuto, G. et al. Hum Gene Ther ii;1891-900 (2000); 
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Widera, G. et al, 1 oflmmmo. 164: 4635-4640 (2000). The use of electrical 
pulses for cell electropemxeabilization has been used to introduce foreigct 
DKA into prokaryotic and eokaryotic cells in ^ntro. Cell pCTaeabilizalion can 
also be achieved locally, in vivoy usmg electrodes and optimal electrical 
parameters that are con^ialible with cell survival. 

The electioporBtion procedure can be perfbimed with various 
electropooration devices. These devices include external plate type electro^ 
or invasive needle/iod electrodes and can possess two electrodes or multiple 
electrodes placed in an array. Distances between the plate or needle electrodes 
can vary depending vpan the number of electrodeSj size of target area and 
treatment subject. 

The TriGxid needle anay, used in exaii[q)les described herein, is a three 
electrode array comprising three elongate electrodes in the approximate shape 
of a ^ometxic triangle. Needle arrays may mclude single, double, tbree, four, 
five, fflx or more needles arranged in various array formations. The electrodes 
are connected through conductive cables to a high voltage switdiing device 
that is connected to apower supply. 

The electrode array is placed into the muscle tissue, around the site of 
nucleic acid injection, to a depth of approximately 3 nun to 3 cm. The depth 
of insertion varies depending iq>on the target tissue and size of patient 
receiving electroporation. After injection of foreign nucleic acid, suck as 
plasmid DNA, and a period of time sufBcient for distribution of the nucleic 
acid, square wave electrical pulses are applied to ttie tissue. The anq)litude of 
each pulse ranges fiom about 100 volts to about 1500 volts, e.g., about 100 
volts, about 200 volts» about 300 volts, about 400 volts, about 500 volts, about 
600 volts, about 700 volts, about 800 volts, about 900 volts, about 1000 volts, 
about 1100 volts, about 1200 volts, about 1300 volts, about 1400 voUs, or 
about 1500 volts or about l-1.5kV/cm, based on fh& spacing between 
electrodes. Each pulse has a duration of about Ifja to about lOOO^s, eg., 
about l^s, about lO^s, about 50)is, about lOO^is, about 200ps, about 300ps, 
about 400p^ about SOO^s, about 600^s, about 700^s, about SOO^s, about 
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900|xs, or about lOOO^s, and a poise fiequency on the order of about 1-10 Hz. 
Tlte polarity of the pulses may be reversed during the electroporation 
procedure by switching the connectors to the pulse generator* Pulses are 
repeated multiple times. The electroporation parameters (e,g. voltage 
an^litude, duration of pulse, number of pulses, depth of electrode insertion 
and fiequency) will vary based on target tissue type, number of electrodes 
used and distance of electrode spadn^ as would be understood by one of 
ordinary skill in the art 

Tmmediately after completion of the pulse r^unen, subjects receiving 
electroporation can be opttonally treated with membrane stabilizing agents to 
inolong cell membrane permeability as a result of the electropotatioiL 
Bxamples of membrane stabilizing agents include, but are not limited to, 
steroids (e.g. dexameOiasone, methylprednisone and progesterone), 
angiotensin n and vitamin B. A single dose of dexamediasone, ^[iproximately 
0.1 mg per kilogram of body weight, should be sufficient to achieve a 
beneficial affect 

EAFD techniques such as electroporation can also be used for plasmids 
contained in liposome formulations. The liposome - plasmid suspension is 
administered to the animal or patient and the site of injection is treated with a 
safe but effective electrical field generated, for exan^le, by a TriGrid needle 
array. The electroporation may aid in plasmid delivery to the cell by 
destabilizing the liposome bUayer so that membrane fusion between the 
liposome and the target cellular structure occurs. Blectroporation may also aid 
in plasmid defiveiy to the cell by triggering the release of the plasmid, inhi£^ 
concentrations, fiom the liposome at the surface of the target cell so that the 
plasmid is driven across the cell membrane by a concentration gradient via the 
pores created in tiie cell membrane as a result of the electn^oratioiL 

Female BALB/c mice aged 8-10 weeks are anesthetized with inhalant 
isoflurane and maintained undo: anesthesia for flie duration of the 
electroporation procedure. The Ipgs are shaved prior to treatment Plasmid 
constructs conqmsing codourOptimized and non-codon-optimized coding 
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regions wMch encoAe LP, PA or various ftagments^ variants or derivatives, as 
described herein, are administer^ to BALB/c mice (n = 10) via unilateral 
injection in the quadriceps, with SO \ig total of a plasmid DNA per mouse, 
using an 0.3 cc insulin syringe and a 26 gauge, 172 length needle fitt^ with a 
plastic collar to regulate injection depth. Approximately one minute after 
injection, electrodes are appHed. Modified calqjer electrodes are used to apply 
the electrical pulse. See HattUdca LetoL Mol Ther i5ft407-41S (2001). The 
calipo: electrode plates are coated with conductivity gel and applied to the 
aides of tibie injected muscle before closing to a gap of 3 nun for administration 
of pulses. EAPD is qyplied using a square pulse type at 1-^10 llz wUfa a fidd 
strength of 100-500 V/cm, 1-10 pulses, of 10-100 ms each. 

Mice are vaccinated ± EAPD at 0, 2 and 4 wedcs. As en^mts, serum 
Ig^ titers against tibie various IJP and PA antigCTs are measure 
Exanq)le 9, as well as Lebc neutralization titers. 

Rabbits (n = 3) are given bilateral injections in the quadriceps muscle 
with plasmid constructs conqnising codon-optimized and non-codon- 
opttmized coding regions which encode LP, PA or various fragments, variants 
or derivatives, as described herein. The implantation area is shavpd and the 
TriGrid electrode array is inq)lanted into the target region of the muscle. 3.0 
mg of plasmid DNA is administered per dose through the injection port of the 
electrode array. An injection collet is used to control tibte depth of injection. 
Electroporation begins i^yproximately one minute z&ear injection of the 
plasmid DNA is complete. Electroporation is administered with a TriGrid 
needle array, with electrodes evenly spaced 7mm <^art, using an Ichor TGP-2 
pulse generator. The array is inserted into the targ^ muscle to a dqptti of 
about 1 to 2 cm. 4-8 pulses are adnunistered. Each pulse has a duration of 
about 50-100 ^s, an anq)litude of about l-1.2kV/cm and a pulse frequency of 
1 Hz. The ingection and electrcyporation may be repeated. 

Sera are collected from vaccinated rabbits at various time point As 
enc^oints, serum titers against the various LP and PA antigens are 
measured as described in Exanq)le 9, as well as Letx neutralization titers. 
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To test the effect of electroporatioii on therapeatic piotem expression 
in non-homan primates, male or finale cynomonologoiis macqu^ are given 
either 2 or 6 LnL injections of plasmid constructs conqirising codon-optimized 
and non-codon-optimized coding regions whidi encode LF, PA or various 

5 fiBgments, variants or derivatives, as described herein, (0.1 to 10 mg DNA 

total per animal). Target muscle groups include, but are not linaited to, 
bilatoal rectus featmoris, ciamal tibialis, biceps^ gpstrocenemius or deltoid 
muscles. The target area is diaved and a needle array, coixq>ru^^ 
and 10 electrodes, spaced between 0.S-1.S cm ^art, is implanted into &e 

10 target mnsde. Once iiijections are conqplrte, a sequence of brief electrical 

pulses arc lulled to the electrodes in^lanted in the target muscle usLog an 
Ichor TGP->2 puke generator. Ihe pulses have an amplitude of qyproximatsely 
120 - 200V. The pulse sequmce is completed witiiino^ Duringftis 
time, the target muscle may mate brief contractions or twitches. The injection 

15 and electcoporation may be repeated. 

Sera are collected from vaccinated monkeys at various time points. As 
endpoints, serum IgG titers against the various LF and PA antigens are 
measured as described in Example 9, as well as Letx neutralization titers. 

20 

The present invention is not to be limited in scope by the specific 
embodiments described which are intended as single illustrations of individual 
aspects of the invention, and any con^ositions or methods which are 

25 fimctionaUy equivalent are within the sc(^ Indeed, various 

modifications of flie invention in addition to those shown and described herein 
will become qiparent to those skilled in tibe art from the foregoing description 
and accompanying drawings. Such modifications are intended to M within 
the scope of the qypended claims. 

30 All publications and patent plications mentioned in this specification 

are herein incorporated by refermce to tiie same extrat as if each individual 
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publication or patent appUcation was specifically and individually indicated to 
be incorporated by refearence. 
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WHATESCXAIMEDIS: 

1. An isolated polynucleotide comprising a nucleic acid fragment 
wfaich encodes at least SO contiguous amino adds of SEQ ID N0:4, wfa^ean 
said nucleic add fi:agment is a fragment of a codon-optiboiized coding region 
5 for die polypeptide of SEQ ID N0:4; 

wherein about 11 ofthe 24 phenylalanine codons in said coding region 
are TXT and about 13 of said phenylalanine codons are TTC; 

wheiein about 5 of Ifae 62 leucine codons in said coding region are 
TTA, about 8 of said leucine codons are TTG» about 8 of said leucine codons 
10 are CTT, about 12 of said leucine codcms are CTC, about 4 of said leucine 

codons are CTA, and d>out 25 of said leucine codons are CT 

wherein about 20 ofthe 57 isoleucine codons in said coding region axe 
ATT, about 28 of said isoleucine codons are ATQ and about 9 of said 
isoleucine codons are ATA; 
15 wherein die 10 methionine codons in said coding region are ATG; 

wherein about 8 of the 43 valine codons in said coding region are GTT, 
about 10 of said valine codons are GTG, about 5 of said valine codons are 
GTA, and about 20 of said valine codom are GTG; 

wh^ein about 1 3 of the 72 serine codons in said coding region are 
20 TCT, about 16 of said serine codons are TCC, about 11 of said serine codons 

ate TCA^ about 4 of said srarine codons are TCG, about 1 1 of said serine 
codons are AGT, and about 17 of said serine codons are AGO; 
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wheieiii about 8 of the 29 proline codons in said coding region are 
CCT, about 1 0 of said proline codons are CCC, about 8 of said proline codons 
are CCA, and about 3 of said proline codons are CCG; 

wherein about 14 of the 38 threonine codons in said coding region are 
ACT» about 21 of said threonine codons are ACC, about 16 of said threonine 
codons are ACA» and about 7 of said direonine codons are ACG; 

v^ecem about 11 of the 41 alanine codons in said coding region are 
GGT, about 17 of said alanme codons are GCC» about 9 of said alanine codons 
are GCA, and about 4 of said alanine codons are GCG; 

wherein about 12 of the 28 tyzosme codons in said coding region are 
TAT and about 16 of said tyrosme codons are TAG; 

wherdn about 4 of the 10 histidine codons in said coding region are 
CAT and about 6 of said histidine codons are CAC; 

wha:dn about 8 of the 31 ghitanune codons in said coding region are 
CAA and about 23 of said ghitamine codons ore CAG; 

wherdn about 32 of the 69 asparagine codons in said coding region are 
AAT and about 37 of said asparagjne codons are AAC; 

wherem about 25 of Ihe 60 lysme codons in said coding region are 
AAA and about 35 of said lysine codons are AAG; 

wherein about 22 of the 47 aspartic acid codons in said coding region 
are GAT and about 25 of said aq)artic acid codons are GAC; 

wherein about 21 of the 51 ghitannc add codons m said coding regi 
are GAA and about 30 of said glutamic add codons are GAG; 
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wherein the 7 tryptophan codons in said coding region are TGG; 

vtdierein about 2 of &6 29 argjnine codons in said coding region are 
CGT, about 6 of said arginine codons are CKSC, about 3 of sai 
codons are CGA^ about 6 of said arginine codons are CGG. about 6 of said 
5 arginine codcms are AGA, and about 6 of said arj^nine codons are AGG; and 

wherein about 6 of the 36 glycine codons in said coding region are 
GGT, about 12 of said glycine codons are GGG, about 9 of said glycine 
codons are GG A, and about 9 of said glydne codons are GGG. 

10 2. The polynucleotide of clahn 1, vidierein said nucleic acid 

firagmcnt encodes at least 100 contiguous amino acids of SEQ ID N0:4. 

3. The polynucleotide of claim 2» wherein said nucleic add 
fragmoit encodes amino adds 199 to 764 of SBQ ID N0:4. 

15 

4. The polynucleotide of claim 3, wherein said nucldc acid 
fiagment comprises nucleotides 82 to 1782 of SBQ ID NO:l. 

5. The polynucleotide of any one of claims l-4» wberean said 
20 micldc add fiagment is lighted to a heterolo^us nucldc acid. 
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6. The polynucleotide of claim S, whom said heterologous 
nDdeic acid encodes a heteiobgous polypeptide fiised to the polypeptide 
encoded by said nucleic acid firagooirat 

5 7, The polynucleotide of claim 6, wherein said heterologous 

polypqitide is a secretory signal pq)tide. 

8. The polynucleotide of claim 7, ^erdn said signal peptide is a 
human tissue plasminogen activator (hTPA) signal peptide. 

10 

9. The polynucleotide of claim 8, comprising nucleotides 13-1782 
ofSEQIDNOrL 

10. The polynucleotide of claim 9, conq>iising SEQ ID NO:L 

15 

1 1 . The polymicleotide of claim 1, wherein said nucleic add 
fiagment encodes anaino acids 30 to 764 of SEQ ID N0:4. 

12. Hie polynucleotide of claim 1 1, wherdn said nucldc acid 
20 jBragmeot is ligated to a heterologous nucleic add 
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13- The polynucleotide of claim 12, whereiii said heterologous 
nucleic acid encodes a heterologous polypeptide fiised to flie polypq>tide 
encoded by said nucleic acid fragment 

5 14. The polynucleotide of claim 13, wfaeiem said heterologous 

polypeptide is a secretoiy signal pq>tide. 

15. The polynucleotide of claim 14, wherein said signal peptide is a 
human tissue plasminogen activator (hXPA) signal peptide. 

10 

16. The polynucleotide of claim 1 1, wherein said nucleic acid 
fragment conqprises nucleotides 88 to 2292 of SBQ n> NO:23. 

17. The polynucleotide of any one of claims 1-16, which is DNA, 
IS and iT^erein said nucleic acid fragment is qpmbly associated with a promoter. 

18. The polynucleotide of any one of claims 1-16, which is SNA. 

19. The polynucleotide of claim 1 8, which is messenger RNA 
20 (mRNA). 



20. A vector c om pri sin g the polynucleotide of any one of claims 1- 

17. 
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2 1 . The vector of claim 20, which is a plasznid. 

22. A composition compiismg the polyancleotide of any one of 
claims 1-19» and a caniear. 

23. The con^sition of claim 21, fiirther compiismg a component 
selected fiom the gtoi^ consisting of an a(^uvant, and a transf ection 
fiu^ilitating conq>omid. 

24. The composition of claim 23, wherein said adjuvant is selected 
from the group cdnsistiiig of: 

(±).N-(3-aminopiopyl)-N;N-KiimethyI-2,3-bis(5^ 
l-propanamnrittm bromide (6AP-DM0SIE) and a n^itral lipid; 
acytokm^ 

mono-pho&phoryl lipid A and trehalosedicorynomycolateAF (MPL + 

a solubilized mono-phosphoiyl Hpid A fonnulation; and 
CRL1005/BAK, 



25. The composition of claim 24, herein said adjuvant 
comprisesCi^N-^S-^inc^pyl^NjN-^^^ 
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tetradecaaeyloxy^l-propanammmn bionude (GAP-DMORIEX and Avfa^ein 

said neutral lipid is selected &om the group oansistnig of: 

l^^lioleoji-OT-g^ycero-S-pho^hoethanolamine (DOPE), 
i;ZHiiphytanoyl-^-gLycero-3-phosphoefha^ (DP>PE), and 
l^-<]imytisto)4-g^yoer-3-phosphoetli^ 09MPE). 

26. The composition of claim 25, wherein said neutral lipid is 
DP^B. 

27. The composition of claim 73, comprising the transfection 
£ici]itatiQg compound (±)-N-(2-hydzpxyethyl)-N,N^Kiimethyl"2,3" 
bi8(tdradec^oxy>l-prppanamimum bromide) (DMRSE). 

28. A method to treat or pieveixt aiithrax infection in a vertebrate 
comprising: administeting to a vertebrate in need thereof flie composition of 
any one of claims 22-27. 

29. An isolated polynucleotide comprising a nucleic add fragment 
wbidi encodes a polypeptide at least 90% identical to amino adds 1 99 to 764 
of SEQ ID N0:4, wherein said nucldc add fragment is a variant fragment of 
an optimized coding region for the polypeptide of SEQ ID NO:4; 

iT^erein about 11 offhe 24 phen^alaninecodons in said coding region 
are TTT and about 13 of said phenylalanine codons are TTC; 



wo 2004/024067 



FCTAJS2003/028199 



-129- 

whexein about S of file 62 leucine codons in said coding region are 
TTA, about 8 of said leucine codons are TTG, about 8 of said leucine codons 
are CTT, about 12 of said Iracine codons are CTC, about4 of saidlracine 
codons are CTA» and about 25 of said leucine codons are CTG; 
S wherdn about 20 of the 57 isoleucine codons in said coding region are 

ATT, about 28 of said isoleucine codons are ATC, and about 9 of said 
isoleucine codons are ATA; 

wherein the 10 methionine codons in said coding region are ATG; 

^is^ierein about 8 of the 43 valine codons in said coding region are OTT, 
10 about 10 of said valine codons are GTG, about 5 of said valine codons are 

GTA, and about 20 of said valine codons are OTG; 

wherein about 1 3 of the 72 serine codons in said coding regiori are 
TCT, about 16 of said serine codons are TCC» about 1 1 of said serine codons 
are TCA, about 4 of said serine codons are TCG, about 1 1 of said smne 
15 codons are AGT» and about 17 of said serine codons are AGC; 

wherem about 8 of the 29 proline codons in said coding region are 
OCT, about 10 of said proline codons are CCC, about 8 of said proline codons 
are CCA, and about 3 of said proline codons are CCG^ 

wherein about 14 ofthe 58 Ihreomne codons in said coding region are 
20 ACT, about 21 of said threonine codons are ACC, about 16 of said threonine 

codons are ACA, and about 7 of said threonine codons are AGO; 
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wheiein about 1 1 of the 41 akoine codons in said coding region are 
GGT, about 17 of said alanine codons are GCQ about 9 of said alanine codons 
ateGCA, and about4 of said alanine codons are OCO; 

vdierein about 12 of the 28 tyrosine codons in said coding region are 
TAT and about 16 of said tyrosine codons are TAG; 

herein about 4 of the 10 histidine codons in said coding region are 
CAT and about 6 of said histidine codons are CAC; 

herein about 8 of Ifae 31 glutamine codons in said coding region are 
CAA and about 23 of said glutanune codons are CAG; 

herein about 32 of the 69 asparagine codons in said coding region are 
AAT and about 37 of said asparagine codons are AAC; 

wheteiii about 25 of the 60 lyBine codons in said coding region are 
AAA and about 35 of said lysine codons are AAG; 

wherdn about 22 of the 47 aspartic acid codons in said coding region 
are GAT and about 25 of said aspartic acid codons are GAC; 

wherein about 21 of the 5 1 glutamic acid codons in said coding region 
are GAA and about 30 of said ghtfamic add codons are GAG; 

\^erein the 7 tryptophan codons in said coding region are TG&, 

wherein about 2 of the 29 arginine codons in said coding region are 
CGT, about 6 of said argbcdne codons are CGC, about 3 of said arginine 
codons are CGA, about 6 of said arginine codons are CGG, about 6 of said 
arginine codons are AGA, and about 6 of said arginine codons are AGG; and 



wo 2004/024067 



PCT/US2003/028199 



-131- 

wherein about 6 of fiie 36 glycme codons in said coding region aie 
GGT, about 12 of said glycine codons are GGC, about 9 of said glycine 
oodons are GGA, and about 9 of said glycine codons are GGG. 

30. The polynucleotide of claim 2% wbsa:wi said nucleic add 
fragment encodes a polypeptide at least 95% identical to amino acids 199 to 
764ofSEQroNO:4. 

31. The polynucleotide of claim 29, wherein the codons in said 
nucleic acid fragment conesponding to axnino acids 342 and 343 of SEQ ID 
N0:4 are deleted. 

32. The polynucleotide of claim 31, wh^ein said nucldc add 
fragmaat encodes amino adds 24 to SS7 of SEQ ID N0:6. 

33. The polynudeotide of claim 32, which comprises nucleotides 
82tol773ofSEQIDNO:5. 

34. The polynucleotide of claim 29, wherein the aspaiagine cc^ons 
in said nucldc add fragment corr^ponding to amino adds 275, 321, 357, 417, 
505, 538, 599, 650, 693, and 738 of SEQ ID N0:4 are delded and each of said 
asparagine codons is replaced with a codon which codes for an amino add 
other than asparagine. 
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35. The polyaodeotide of claim 34, wherein ttie asparagjne codons 
in said nucleic add fiagment cozreEfpom&igto amino acids 275, 321, 357, 417, 
505, 538, 599, 650, 693, and 738 of SEQ ID N0:4 are deleted and each of said 
5 aspaiagine codons replaced with a codon whidi codes for ghitamine. 

36* The polynucleotide of claim 35, "herein said nucleic add 
fiagmCTt encodes amino adds 24 to 589 of SEQ ID N0:18. 

10 37. The polynudeotide of claim 36, which conqnises nucleotides 

82 to 1779 of SEQ ID N0:17. 

38* The polynucleotide of claim 31, wherein ike asparagme codons 
in sdd nncldc add fiagment corresponding to amino adds 275, 321, 357, 417, 
15 505, 538, 599, 650, 693, and 738 of SEQ ID NO:4 are deleted and eadi of said 

asparagine codons is replaced with a codon which codes for an amino acid 
other than asparagine. 

39. The polynucleotide of claim 38, wherein the asparagine codons 
20 in said tmcldc add fragment corresponding to amino adds 39, 153, 275, 321, 

357, 417, 505, 538, 599, 650, 693, and 738 of SEQ ID N0:4 are deleted and 
eadi of said asparagine codons Tq>laced with a codon which codes for 
glutamine. 
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40. The polynucleotide of my one of clauns 29-39, wherdn said 
nucleic add fiagmeat is figated to a heterologous nucleic add. 

41. The polynucleotide of claim 40, iwherdn said heteiolo^us 
nucldc add encodes a heterologous poIypq)tide fused to the polypeptide 
encoded by said nucleic add fragment. 

42. The polynucleotide of claim 41, wherdn said hete3x>logous 
polypeptide is a secretory signal peptide. 

43. The polynudeotide of claim 42» iivherdn said signal peptide is a 
human tissue plasminogen actii^tor (hTPA) signal p^ttdc. 

44. The polynucleotide of si^ one of daims 29-43, ^^vfaich is DNA, 
and wherein said nucldc acid fragment is op^rably associated with a promoter. 

45. The polynucleotide of any one of clainM 29-43, whidi is RNA. 

46. The polynudeotide of claim 45, i?vhidi is messen^ RNA 
(mRNA). 
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47. A vector conqmsing the polynucleotide of any one of claims 

29-44. 

48. The vector of clahn 47, which is a plasmid 

49. A composition comprising flie polynucleotide of any one of 
claims 29-46, and a canier. 

50. The conoposition of claim 49, further comprisii^ a component 
selected fiom the groi^ consisting of an adjuvant, and a transfection 
facilitating compound. 

5 1 . The CQnqx)sition of claim SO, T^Aierein said adjuvant is selected 
fix>m the group consisting of: 

(db)4^^3-aminopit>pyl)-N,N-dimethyl-23-bis(^ 
l-propanaminium bxomide (GAP-DMORIB) and a neutral lipid; 
a cytokine; 

mono-phosphoryl lipid A and trehalosedicorynomycolateAF (MPL + 

TDM); 

a sohibilized mono^phosphoi^ lipid A fonnulatioz]^ and 
CRLIOOS/BAEL 
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52. Hie coiqposition of claim 5 1» \^erem said adjuvant 
caii5d5es(±)-N<3-ammopK>pyl^^ 

tetFadec6neyloxy)-l-piDpanam]mum biomide (GAP-DMORIE), and wherein 

said neutral lipid is selected fiom the gcoixp consisting of: 

l^-4ioleoyl-^-glycero-3-phosplK>elhano ^OPB), 
l»2-diphytano}d^n-glycero-3-phoq)hoefhanolamin6 (DPyPE), and 
l^--dimyristoyl-g>ycer-3i)hosphoertiaTK)1am^ (DMPB). 

53. The coDDposition of claim 52, wherein said nratral lipid is 

54. The composition of claim SO, comprising the ttansfection 
fecilitating compound (±)-N-(2-hydroxyBttiyl>N^-dimediyl-2,3- 
bis(tetradeo^dxy)"l"pn)pananiinium bromide) (DMRIB). 

55. A method to tieat or prevent anthrax infection in a vertebrate 
comprising: adnodnistering to a vertebrate in need thereof the conqpodtion of 
any one of claims 49-54. 

56. An isolated polynucleotide con^risiag a nucleic acid fragment 
which encodes a polypeptide at least 90% identical to anoino acids 30 to 764 of 
SEQ ID N0:4, wherem said nucleic acid fragment is a variant fragment of an 
optimized coding region for the polypeptide of SEQ ID N0:4; 
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Mdiea:ein about 11 of the 24 phenylalanine codons in said coding region 
are TTT and about 13 of said phenylalanine codons are TTC; 

ivherein about 5 of the 62 leucine codons in said coding region are 
TTA» about 8 of said leucine codons are TTG, about 8 of said leucine codons 
S are CTT, about 12 of said leucine codons are CTQ about 4 of said leucine 

codons are CTA, and about 25 of said leucine codons are CTG; 

herein about 20 of fhe 57 isoleudne codons in said coding region are 
ATT, about 28 of ^d isoleudne codons are ATC, aud about 9 of said 
isol€ucine codons arc ATA; 
10 wherein the 10 methionine codons m said coding region are ATG; 

wherein about 8 ofUxe 43 valine codons in said coding region are GTT, 
about 10 of said valine codoDS m GTG, about 5 of said valine codons are 
GTA, and about 20 of said valine codons are GTG; 

indxerein about 13 ofthe 72 serine codons in said coding region are 
15 TCT, about 16 of said serine codons are TCC, about 11 of said serine codons 

are TCA, about 4 of said serine codons are TCG, about U of said serine 
codons are AGT, and about 17 of said serine codons are AGC; 

wherein about 8 of tiie 29 proline codons in said coding region are 
CCT, about 10 of said proline codons are CCC^ about 8 of said proline codons 
20 are CCA, and about 3 of said proline codons are CCG; 

wherem about 14 ofthe 58 threonine codons in said coding region are 
ACT» about 21 of said threonine codons are ACC, about 16 of said threonine 
codons are ACA, and about 7 of said threonine codons are ACG; 
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wherem about 11 ofthe41 alaxiinecodons in said codixig region are 
GGT» about 1 7 of said alanine codons are GCX:» about 9 of said alanine codons 
are GCA, and about 4 of said alanine codcms axe GCG; 

wherein about 12 of the 28 tyrosine codons in said coding region are 
S TAT and about 16 of said tyrosine codons are TAG; 

wherein about 4 of the 10 histidiae codons in said coding region are 
CAT and about 6 of said histidme codons are CAC; 

^ecein about 8 of the 3 1 glutamiae codons in said coding region are 
CAA and about 23 of said g^utamine codons are CAG; 
10 wherein about 32 of the 69 asparag^e codons in said coding region are 

AAT and about 37 of said asparagine codons are AAC; 

herein about 25 of the 60 lysine codons in said coding region arc 
AAA and about 35 of said lysine codons are AAG; 

\(4ierem about 22 of the 47 aspartic add codons in said coding region 
IS are GAT and about 25 of said aspartic acid codons are GAC; 

wherein about 21 of the 5 1 glutamic acid codons in said coding region 
are GAA and about 30 of said glutamic acid codons are GAG; 

wherein the 7 tryptophan codons in said coding region are TGO; 

wherein about 2 of the 29 arghnne codons in said coding region are 
20 CGT, about 6 of said arginine codons are CGC, about 3 of said aiginine 

codons are CGA, about 6 of said arginine codons are CGG, about 6 of said 
arginine codons are AGA, and about 6 of said arginine codons are AGG; and 
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whradn about 6 of the 36 glydne codons in said coding region are 
GGT, about 12 of said glycine codons are GQC, about 9 of said glycine 
codons are GGA, and about 9 of said glycine codons are GGG. 

57. The polynucleotide of claim 56, wbtoreui said nucleic acid 
fragment encodes a polypeptide at least 95% identical to amino acids 30 to 
764ofSEQIDNO:4, 

58. The polynucleotide of claim 56, wherein the codons in said 
nucldc add fragment corresponding to amino acids 192 to 197 of SEQ ID 
N0:4 are deleted. 

59. The polymicleotide of claim 58, wherein said nucleic add 
fragment encodes amino adds 24 to 752 of SEQ ID N0:8. 

60. The polynucleotide of claim 59, which comprises nucleotides 
82to2268ofSEQIDNO!7. 

61. The polynucleotide of claim 56, wherdn the asparagjne codons 
in add nucldc acid fragment corresponding to amino acids 39, 153, 275, 321, 
357, 417. 505, 538, 599, 650, 693, and 738 of SEQ ID N0:4 are deleted and 
each of said asp aragine codons is rq)laced with a codon Vfkdct codes for an 
amino acid other than asparagine. 
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62. The polynucleotide of claim 61, wherein the asparagine codons 
in said nucldc add fragment corre^nding to amino adds 39, 153, 275, 321, 
357, 417, 505, 538, 599, 650, 693, and 738 of SBQ ID N0:4 are deleted and 
fi^ rih of said asparagine codons rq>laced with a codon whidi codes for 

5 gjiutamme. 

63. The polymicleotide of claim 58, wbsxeia fhe asparagine codons 
in said nucldc add fragment corresponding to amino adds 39, 153, 275, 321, 
357, 417, 505, 538, 599, 650, 693, and 738 of SEQ ID N0:4 are deleted and 

10 eachof said asparagine codons is replaced with a codon "V^c^ for an 

amino add other than asparagine. 

64. The polynucleotide of claim 63, wherein tiie asparagine codons 
in said nucldc add j&agment coiresponding to amino adds 39, 153, 275, 321, 

15 357, 417, 505, 538, 599, 650, 693, and 738 of SEQ ID NO:AP are deleted and 

each of said asparagine codons replaced with a codon which codes for 
glutamine. 

65. The polynucleotide of any one of claims 56-64, wherdn sdd 
20 nucldc add fragment is figated to a heterologous nucldc add. 
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66. The polynucleotide of claim 65, wherein said heteobgous 
nucleic acid encodes a heterologous polypq^tide fused to the polypq>tide 
encoded by said nucleic add fiagment 

67. The polynucleotide of claim 66, wherein said heterologous 
polypeptide is a secretory signal pqptide. 

68. The polynucleotide of claim 67, wherein said signal pqptiLde is a 
human tissue plasminogen activator (hTPA) signal peptide. 

69. The polynucleotide of any one of claims 56-68, which is DNA, 
and wherein said nucleic acid fiagment is operably associated with a promoter. 

70. The polynucleotide of any one of claims 56-68, which is RNA. 

71. Hie polynucleotide of claim 70, which is messeaogerRNA 
(mBNA). 

72* A vector comprising the polynucleotide of any one of claims 

56-69. 



73. The vector of claim 72, which is a plasmid. 
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74. A composition coiiq)risiDg the pol^ucleotide of any one of 
daims 56-71, and a carder. 

75. The CQo^osition of claim 74, iiirther comprising a component 
selected fiom fhe group consisting of an adjuvant, and a transaction 
facilitating conipoimd. 

76. The conqiosition of claim 75, ^^^lerein said adjuvant is selected 
from the gioiip consisting of: 

(db)-N-(3-aminopropyl)-N^-dimefhyl-2,3-bis(^ 
l-propanaminium bromide (GAF-DMORIE) and a neutral lipid; 
a cytokine; 

mono-phosphoryl Iqnd A and trehalosedicoiynomycolateAF (MPL + 

TDM); 

a solubilized monoi>hosphoiyl Hpid A formulation; and 
CRL1005/BAK. 

77. The composition of claim 76, herein said adjuvant 
comprises(db).N-(3-aminopn^yl)-N,N^iimeth)^^ 

tetradeceneyloxy)-l-propanaminimn bromide (GAP-DMORIE), and Mvfaerein 

said neutral lipid is selected &om the group consisting of: 

l,2«dioleoyl-^-glycero-3-phosphoethanolaTnine (DOPE), 
l,2-diphytano3dnS7i-glycero-3-phosphoefhanolamine (DPyPE), and 
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l^Hlimyiisto>d-gIycer-3-pho8phoetfaano^ (DhSFE). 

78. The cotnpositiQn of claim 77» whemn said neutral lipid is 
DPjPE. 

5 

79. The coiK3>osition of claim 75, comprising the transfection 
facilitating oompoimd (±>N<2-hytlit>xyeth)l>Nj4-d^ 
bi8(teti:adecyloxy)-l-piopananiimimifaiondd^^ (DMRIB). 

10 80. Amefhodtotreatorpreventantfai^in&cticmmayertdn^ 

con^nising: administering to a vertd)rate in need thereof the composition of 
any one of claims 74-79. 

81. An isolated polynucleotide comprising a nucleic acid firagment 
IS which encodes at least SO contiguous amino adds of SBQ ID K0:12, wherein 

said nucleic acid firagjtnent is apoition of an optimized coding region for Ihe 
polypeptide of SBQ ID N0:12; 

\vherem about 13 of the 29 phenylalanine codons in said coding region 
are TTT and about 16 of said phen^alanine codons are TTC; 
20 wherem about 6 of the 80 leucine codons in said coding tegicm are 

TTA, about 10 of said leucine codons are TTG» about 10 of said leucine 
codons are CTT» about 16 of said leucine codons are CTC, about 6 of said 
leucine codons are CIA, and about 32 of said leucine codons are CTG; 
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whereiii about 26 of ttie 74 isoleucme codons in said coding region are 
ATT» about 36 of said isoleucine codons are ATC, and about 12 of said 
isoleodns codons are ATA; 

"herein the 10 methionine codons in said coding region are ATC^ 

"VTberem about 7 of die 40 valine codons in said coding region are GTT^ 
about 9 of said valine codons are GTG» about 5 of said valine codons are 
GTA, and about 19 of said valine codons are GTG^ 

wberdn about 10 of the 54 serine codons in said coding region are 
TCT, about 12 of said serine codons are TCC^ about 8 of said serine codons' 
are TCA, about 3 of said serine codons are TCG, about 8 of said serine codcms 
are AGT» and about 13 of said s^ine codons are AGC; 

wherein about 6 of the 21 proline codons in said coding region are 
CCT, about 7 of said proline codons are CCC, about 6 of said proline codons 
are CCA, and about 2 of said proline codons are CCG; 

herein about 7 of the 28 threonine codons in said coding region are 
ACT, about 10 of said threonine codons are ACC, about 8 of said threonine 
codons are ACA, and about 3 of said threonine codons are ACG; 

wherein about 9 of the 34 alanine codons in said coding region are 
GGT» about 14 of said alanme codons are GCC, about 8 of said alanme codons 
are OCA, and about 3 of said alanine codons are GCG; 

herein about 15 of the 35 tyrosine codons in said coding region are 
TAT and about 20 of said tyrosine codons m TAC; 
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wherein about 9 of the 21 histidine codons in said coding region are 
CAT and about 12 of said histidine codons are CAC; 

\vlieicdn about 10 of tiie 41 g^utamine codons in said coding r^on are 
CAA and about 31 of said ghjtaioine codons are CAO; 

wherdn about 25 of the 54 aspaiagine codons in said coding region are 
AAT and about 29 of said asparagine codons are AAC; 

whoein about 36 of the 86 lysine codons in said coding region are 
AAA and about 50 of said lysine codons are AAQ; 

wherein about 25 of the 55 a^artic acid codons in said coding region 
are GAT and about 30 of said aspartic acid codons are GAC; 

wherein about 33 of the 79 gtutamic acid codons in said coding region 
are GAA and about 46 of said ghitaniic acid codons are GAG; 

wherdn the single cysteine codon in said coding region is selected 
fix>m the group consisthig of TGT and TGC; 

wherein the 5 tiyptc^han codons in said coding region are TGC^ 

wherein about 2 of the 27 arginine codons in said coding region are 
CGT, about 5 of said argmine codons are CGC, about 3 of said arginine 
codons are CGA« about 6 of said argjnine codons are CGG, about 6 of said 
aighiine codons axe AGA, and about 5 of said argmine codons are AGG; and 

wherein about 6 of the 35 glydne codons in said coding region are 
GGT, about 12 of said glycine codons are GGC, about 8 of said glycine 
codons are GGA, and about 9 of said glycine codons are GGO. 
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82. The polynucleotide of claim 81, wherein said cysteine codon in 
said coding region is TOT. 

83. The polynucleotide of claim 81, wherein said cysteine codon in 
said coding region is TGC. 

84. The polynucleotide of claim 81, ^^lerein said nucleic acid 
firagment encodes at least 100 contiguous amino acids of SEQ n> NO: 12. 

85. The polynucleotide of claim 84, wherrai said nucleic acid 
fragment encodes amino adds 34 to 809 of SEQ ID NO: 12. 

86. Ihe polynucleotide of claim 85, wherein said nucleic acid 
fiagmeot comprises nucleotides 99 to 2427 of SEQ ID NO:26. 

87. The polynucleotide of claim 81, wherein said nucleic acid 
jB:agmmt mcodes amino adds 34-583 of SEQ JD NO:12. 

88. The polynucleotide of claim 81 , wherein said nucleic acid 
fragment racodes amino adds 34-254 of SEQ ID NO:12. 

89. The polynucleotide of claim 81, wfaerem said nucleic add 
fragment encode amino acids 34-295 of SEQ ID NO:12. 
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90. The polynucleotide of any one of claims 81-89, whoiein said 
nncleic add firagment i$ ligated to a hetaologpus nucleic acid. 

9L The polyniicleotidd of claiin 90, wherein said heterologous 
nucleic add encodes a heterologous polypq)tide fiised to the polypeptide 
encoded by said nucldc add fiagment 

92. Hie polynucleotide of claim 91, wherein said heterologous 
polypeptide is a secretoiy signal peptide. 

93. The polynucleotide of claim 92, wfaerdn said signal peptide is a 
human tissue plasminogw activator (hTPA) signal peptide. 

94. The polynudeotide of any one of claims 81-93, which is DNA, 
and wherein said nucldc add fragment is operably assodated with a promoter. 

95. The polynucleotide of any one of daims 81-93, which is RNA- 

96. The polynucleotide of claim 95, wfaidi is m^senger RKA 
(mRNA). 
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97. A vector coiDcqprisbag the polynucleotide of any one of claims 

81-.94. 

98. The vector of claim 97, which is a plasmid. 

99. A composition conq)nsing the polynucleotide of any one of 
claims 81-96, and a carrier. 

100. The composition of clann 99, further conq;>rising a conqKmeat 
selected &om the groiqp consistiiig of an adjuvant, and a transfection 
&cilitating compound. 

101 . The composition of claim 100, wherdn said adjuvant is 
selected fiom the ffxmp consisting o£ 

(db).N'<3-aminoprop:^NJT<limeth3d^^^ 
l^^ptopanaminium bmniido (GAF-DMORIE) and a neutral lipid; 
a cytokine; 

mono-phosphoryl lipid A and trehalosedicorynomycolateAF (MPL + 

TDM); 

a solubilized mono-phosphoiyl lipid A formulation; and 
CRL1005/BAIC 
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102. Tlie conq>osition of claim 101, wherein said adjuvant 
con^)Tises(db)-N*^3'-azmnopropyl)-N,N-di^ 

tetradficmeyloxy)-l-propanimnmuni bromide (GAP-DMORIE)» and wfaerdn 

said neabral t^id is selected from the gtoq) consisting of: 

l^-dioleoyI-^n-glyceto-3^hosphoettianola^^ (DOPBX 
l^-diphytanoyi-5n-glycero-3*phosplK)e1faanok^^ (DPyPB), aod 
l^-dimyristo^-gLycer-^S-phosphoethanolamine (DMPE)« 

103. The compositioa of claim 102, wfaerem said neutral lipid is 
DPyPE, 

104. The conq)osition of claim 100, conqxrising the tranafection 
fedlitating conrpoimd (±>N-<2-hydhx)xyethyl)-N,N-dimefhyl-2,3- 
bis(tetFBdecyioxy)*-l-propanan]inium bromide) 0>MRIE), 

105. A method to treat or prevent anthrax infection in a v^tebrate 
conq3iising: administedng to a vertebrate in need thereof the composition of 
any one of claims 99-104. 

106. An isolated polyxmcleotide comprising a nucleic add fragment 
which encodes apolypeptide at least 90% identical to amino acids 34 to 809 of 
SEQ ID NO:12, wherein said nucleic add fr^^ieot is a variant fragment of an 
optimized coding region for the polypeptide of SEQ ID NO: 12; 
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^erein about 13 of the 29 pbeaylalanme codons in said coding region 
are TTT and abont 16 of said phenylalanine codons are TTC; 

wfa^rrin abont 6 of Ihe 80 leucine codons in said coding region are 
TTA, about 10 of said leucine codons axe TTG, about 10 of said leucine 
codons are CTT, about 16 of said leucine codons are CTC, abont 6 of said 
leucine codons are CTA, and about 32 of said leucine codons aie CTG; 

wberdn about 26 of the 74 isoleudne codons in said coding region are 
ATT, about 36 of said isoleucme codons are ATC, and about 12 of said 
isoleucine codons are ATA; 

T^a:ein the 10 methionine codons in said coding region are ATG; 

wherein about 7 of the 40 valine codons in said coding region are GTT» 
about 9 of said valine codons are GTG» about 5 of said valine codons are 
GTA, and about 19 of said valine codons are GTG; 

wherein about 10 of the 54 serine codons in said coding reg^n are 
TCT, about 12 of said serine codons are TCC» about 8 of said serine codons 
are TCA, about 3 of said serine codons are TCG, about 8 of said serine codons 
are ACT, and about 13 of said serine codons are AGC: 

wherein about 6 of the 21 proline codons in said coding region are 
CCt, about 7 of said proline codons are CCC, about 6 of said proline codons 
areCCA^ and about 2 of said proline codons are CCG; 

whoein about 7 of the 28 threonine codons in said coding region are 
ACT, about 10 of said threonine codons are ACQ about 8 of said threonine 
codons are ACA, and about 3 of said threonine codons are ACG; 
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wheiein about 9 of the 34 alanine codons in said coding region are 
GGTy about 1 4 of said alanine codons are GCC, about 8 of said alanine codons 
are GCA, and aboid 3 of said alanine codons axe GCG; 

wherein about IS of the 3S tyrosine codons in said coding region are 
TAT and about 20 of said tyrosine codons are TAC; 

wherein about 9 of the 21 histidine codons in said coding region are 
CAT and about,12 of said histidine codons are CAC; 

^^Aierein about 10 of the 41 gtutsmine codons in said coding region are 
CAA and about 31 ofsaidglutamine codons are CAG; 

^(^leiein about 25 of the 54 asparagine codons in said coding region are 
AAT and about 29 of said asparagine codons are AAC; 

wherein about 3 6 of the 86 lysme codons in said coding region are 
AAA and about SO of said lysine codons are AAG; 

wherein about 25 of the 55 aspartic acid codons in said coding region 
are GAT and about 30 of said aspartic add codcms are GAC; 

herein about 33 of the 79 glutamic acid codons in said coding region 
areGAA and about 46 of said glutamic acid codons are GAG; 

wherein the single cysteine codon in said coding region is selected 
fiom the groiq> consisting of TGT and TGC; 

wherein flie 5 tryptophan codons in said coding region are TGG; 

wherein about 2 of the 27 arginine codons in said coding region are 
CXjT, about 5 of said arginiae codons are CGC, about 3 of said arginine 
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codcms are CGA, about 6 of said arginme codons are CGG, about 6 of said 
aiginmecodoDsaieAGAyandaboutSofsaidargio^ and 

wherein about 6 of the 35 glycine codons in said coding region are 
GGT, about 12 of said glycine codons are GGC» about 8 of said glyciae 
codons are GGA, and about 9 of said glycine codons are GGG. 

107. Hie polynucleotide of claim 106, wherein said cysteine codon 
in said coding region is TGT. 

108. The polynucleotide of claim 106, ^(dierein said cysteine codon 
in said coding region is TGC* 

109. The polynucleotide of claim 106, wherein nucleic acid 
fiagment encodes a polypq)tide at least 95% identical to amino adds 34 to 
809ofSEQIDNO:12. 

110. The polynucleotide of claim 106, wherem the histidine codons 
in said nucleic acid fiagment corresponding to positions 719 and 723 of SEQ 
ID N0:12 are deleted and each is replaced with a codon whidi codes for an 
amino acid other than histidine, and wheiein the glutamic add codon in said 
nucldc acid fragment coire^^onding to position 720 of SEQ ID NO: 12 is 
deleted and rq)laced with a codon whidi codes for an amino acid other than 
glutamic add. 
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111. Tbe polynudeotLde of claim 110, wherein the histidine codons 
in said nucleic add j&agment con^ondiiig to positions 719 and 723 of SEQ 
ID N0:12 are delved and each is rqilaced with an alanine codon selected 
S fimn the ffoxsp consisting of GCT, GCC, GCA, and GCO, andi^ereinlfae 

glutamic add oodon in said nucldc add firagment corresponding to portion 
720 of SEQ ID NO:12 is deleted and replaced with an aspartic add codon 
selected fiom the group consisting of GAT and GAC 

10 1 12. The polynudeotide of claim 1 1 1, which conqnises nucleotides 

82to2409ofSEQroNO:9. 

1 13. The polynacleotide of claim 106, wherein the asparagxne 
codons in said nucldc acid jGragment corresponding to positions 62, 212, 286, 

1 5 478, 712, 736, and 757 of SEQ ID N0:12 are deleted and eadx is replaced 

with a codon which codes for an amino acid other than asparagine. 

1 14. Ihe polynucleotide of claim 1 13, wherein the asparagine 
codons in said nucldc add fiagment corresponding to podtions 62, 212, 286, 

20 478, 712, 736, and 757 of SEQ ID N0:12 are deleted and eadi is replaced 

with a glutamine codon selected from the groiq) consisting of CAA and C AG. 
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115. Thepolynucleotide of claim 110, wherraillieaspaiagine 
codons in said ixucldc acid firagment carresposding to positioos 62, 212, 286, 
478, 712, 736, and 757 of SEQ ID N0:12 are ddeted and eadi is replaced 
with a codon wbicli codes for an amino acid olfaer than asparag^e. 

5 

1 16. The polynucleotide of claim 115, wherein the aspaiagine 
codons in said nncldc acid finagment corresponding to positions 62, 212, 286, 
478, 712, 736, and 757 of SBQ ID N0:12 are deleted and each is replaced 
with a glutamine codon selected fiom the groiq) consistiDg of CAA and CAG. 

10 

1 17. The polynucleotide of claim 1 16, wherem said glutamine codon 
is CAA. 

118. The polynucleotide of claim 116, wherein said glutamine codon 
15 isCAa 

1 19. The polynucleotide of claim 116, which comprises nucleotides 
82 to 2409 of SEQ ID NO:19. 

20 120. The polynucleotide of any one of claims 106-1 19, wherein said 

nucleic acid fiagmeat is Ugated to a heterologous nucleic acid. 
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121. Hie polynucleotide of claim 120, Western said heterologous 
nucleic acid encodes a hetecolo^us polyp^tide fused to the polypqitide 
encoded by said nucleic add fragment 

122. The polynucleotide of claim 121» wherein said heterologous 
polypeptide is a secretory signal peptide. 

123. The polynucleotide ofclaim 122, wherein said signal peptide is 
a human tissue plasminogen activator (hTPA) signal peptide. 

124. The polynucleotide of any one of claims 106-123, which is 
DNA, and wherein said nucleic acid fiagmeaot is op mbly associated with a 
promoter. 

125. The polynucleotide of any one of claims 106-123, which is 

RNA. 

126. The polynucleotide of claim 125, which is messenger RNA 
(mBNA). 



127. 
106-124. 



A vector conqirising the polynucleotide of any one of claims 
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128. Hie vector of claim 127, vMch is a plasmid. 

129. A composition coiEqpnsixig tibie polynucleptide of any one of 
claims 106-126» and a carrier. 

130. Hie composition of claim 129, further compnsmg a component 
selected! fixnn the groiqp consisting of an adjuvant, and a transfection 
ficilitating conqpomid. 

131. The composition of claim 130, Therein said adjuvant is 
selected fixmi the group consisting of: 

(db)-N<3-ammopropyl)-NJ^-<iimethyl-2;3-bi<^ 
1-propanaminium bromide (GAP-DMORIE) and a neutral lipid; 
a cytokine; 

mono-pho^horyl lipid A and trehalosedicorynomyoolateAF (MPL 4* 

TDM); 

a solubilized mono-phosphoi^ lipid A formulation; and 
CaJL1005/BAK. 

132. The composition of claim 131, T^ierem said adjuvant 
coinprises(±)-N-(3-aminopropyl>NJ4^-dimethyl-23-bis(^^ 
tetradecene^oxy)-l-propanaminium bromide (GAP-DMORIE), and wherein 
said neutral lipid is selected fiom the groiqp consisting of: 
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l^-dioleoyl'^-glycerD'3-phosphoelhanolaTnme (DOPE), 
l^-diphytanoyl--^-glycerO''3"phoyhoe<han (DPyPE), and 

lyZ-dimyristoyi-glyaer-S-pbospho^ian^ (DMPE). 

133. The conqMJsition of claim 132, herein said neutral lipid is 
DPyPB. 

134. The con^joaition of claim 130» comprising fhe transfecticm 
&cilitating compound (±)-N-(2-hydioxye1hyl)-N«N-dim6lhyl-2»3*' 
bi8(tebadecyloxy)-l-p!Fopanaz^ bromide) (DMRI^. 

135. A me&od to treat cor prevent anthrax infection in a vertd)ntte 
comprising: administering to a v^lebiate in need thereof fhe composition of 
any one of claims 129-134. 
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1 gatatcgoca ccatggabgc aatgaagaga gggctctgct gtgtgctgct gctgtgtgga 

md amkr glc cvl llcg 

61 gcagtcttcg tttcgcccag cagcgctggg ccaactgtgc ccgacagaga caatgatgga 

av£ vsp ssag ptv pdr dndg 

121 atccctgata gtctagaggt tgagggataa acggtagatg tcaagaacaui aaggactttt 

ipd sle vegy tvd vkn krtf 

181 ctctcgcott ggatctcaaa tatccatgag aagaaggggc ttacoaagta caagtcctcc 

lap wis nihe kkg Itk ykas 

241 cccgagaagt ggtctacogc ttccgatoca tatagcgatt tcgagaaggt cacaggccgg 

pek W8t aadp ysd fek vtgr 

301 atcgataaaa atgtgtctcc agaggctaga cacoccctgg tagcagccta cccgattgta 

idk nva pear hpl vaa yplv 

361 cacgtggaca tggagaacat cattctaagc aaaaaogagg accagtccajc acaaaacact 

hvd men ilia kne dqs tqnt 

421 gactccgaga cccgcaccat atctaaaaac accagtactt caaggaccca cacctctgaa 

dse t.rt iskn tat art htse 

481 gtgcacggca atgcggaagt ccatgcatcg tttttcgata ttggtggctc cgtgtcagcc 

vhg nae vhas ffd igg svsa 

541 ggctttagca atagcaactc ctcgacggtt gccattgaoc actcactgtc attagcaggt 

gfa nan s etv aid hsl slag 

601 gagaggactt gggctgaaac tatgggtotg aataccgccg atacggcccg gctcaacgca 

ert was tmgl nta dta r^lna 

661 aatattcggt acgtcaacac agggactgct cctatatata acgtgctgcc taogacaagt 

nir yvn tgta piy.nvl ptta 

721 cttgtccbgg gcaaaaatca gaccctcgca accattaagg oaaaggaaaa tcagctgagc 

Ivl gkn qtla tik^ake nql a 

781 cagatcctcg cccctaacaa ctattatcca tccaaaaatt tagcccccat agccctgaac 

qil apn xiyyp skn lap lain 

841 gcccaggacg acttttcctc tacccccata actatgaatt acaatcagtt cctggagctg 

aqd dfs stpl tmn ynq flel 

dOl gaaaagacga agcagctgag actagacacc gatcaggtgt atggaaacat agcgacatat 

ekt kql rldt dqv ygn iat y 

961 aactttgaga acggccgcgt gcgcgtcgac actgggtcaa actggtctga agttctgccg 

nf*e ngr vrvd tgs nws evlp 

1021 caaattcaag agacaaccgc cagaattatc tttaatggga aggacttgaa ccttgtcgaa 

qiq ett aril £ng kdl nlve 
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Figure XB 

1081 cgtagaattg ccgccgtgaa ccccagtgat ccactcgaga cgactaaacc ggatatgaca 
rri aav npsd pie ttk pdmt 



1141 ctgaaagagg ctctgaagat tgccttcgga ttcaacgaac ctaatggcaa tttgoagtat 
Ike alk lafg fne png nlqy 



1201 caggggaaag acatcacaga gtttgatttc aatttogatc agcagacttc ccaaaatatc 
qgk dlt e€d£ nfd qqt aqni 



1261 aaaaatcagt tggcagagct gaatgccacc aatatctaca cggttctcga taaaatcaaa 
knq lae Inat nly tvl dkik 



1321 Gttaacgcca agatgaacat attgattcga gacaaacgct tccactacga cogcaacaat 
Ina kmn illr dkr fhy drnn 



1381 atagccgtag gcgctgatga gtctgtcgtc aaggaggota atagggaagt tatcaacagc 
lav gad esv v kea hre vine 

1441 agtactgaag ggctgttact taatatcgac aaggacattc ggaagatcct gtccgggtat 
flte gll laid kdi rki Isgy 

1501 atcgtggaga tcgaggatac cgagggcctg aaggaagtca ttaacgaccg ctatgatatg 
ive led tegl kev ind rydm 

1561 otgaacattt ccagcttacg acaggacggt aagaoattta ttgaotttaa aaagtataac 
Ini cfll rqdg ktf idf kkyn 

1621 gacaagctac ccctgtacat ttccaaccca aattacaaag ttaatgtgta tgctgtaacc 
dkl ply isnp nyk vnv yavt 

16 8X aaggagaaca caatcatcaa tccaagogag aacggcgata ccagcacaaa tggaatcaiaa 
ken tli npse ngd tst nglk 



1741 aagatoctta batttagtaa aaaaggctac gagatcggtb gaggatoc 
kil iffl kkgy eig - 
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Figure 2A 

I gatatcgcca ccatggatgc aatgaagaga gggctctgct gtgtgctgct gctgtgtgga gcagtcttcg 
rad amJcr glc cv^l IXcg avf 

71 .tttcgcccag cagcgctggg ccaactgtgc ccgacagaga caatgatgga atccctgata gtctagaggt 
vsp ssag ptv pdr dndg ipd sXe 

141 tgagggatac acggtagatg tcaagaacaa aaggactttt ctctcgcctt ggatctcaaa tatccatgag 
vegy tvd vkn krtf Isp wis nihe 

211 aagaaggggc ttaccaagta caagtcctcc cccgagaagt ggtctaccgc ttccgatcca tatagcgatt 
kkg Itk yksa pek wat asdp ysd 

281 tcgagaaggt cacaggccgg atcgataaaa atgtgtctcc agaggctaga caccccctgg tagcagccta 
£e'k vtgr idk nvs pear hpl vaa 

351 cccgattgta cacgtggaca tggagaacat cattctaagc aaaaacgagg accagtccac acaaaacact 
ypiv hvd nea ills kne dqs tqnt 

421 gactccgaga cccgcaccat atctaaaaac accagtactt caaggaccca cacctctgaa gtgcacggca 
dae trt iskn tat art htae vhg 

491 atgcggaagt ccatgcatcg gatattggtg gctccgtgtc agccggcttt agcaatagca actcctcgac 
nae vhaa dig gav.aagf a n s nas 

561 ggttgccatt gaccactcac tgtcattagc aggtgagagg acttgggctg aaactatggg tctgaatacc 
tvai dhs Isl agar twa etia glnt 

631 gccgatacgg cccggctcaa cgcaaatatt cggtacgtca acacagggac tgctcctata tataacgtgc 
adt arl nani ryv ntg tap! ynv 

701 tgcctacgac aagtcttgtc ctgggcaaaa atcagaccct cgcaaccatt aaggcaaagg aaaattagct 
*lpt tslv Igk'aqt lati kak enq 

771 gagccagatc ctcgccccta acaectatta tccatccaaa aatttagccc ccatagccct gaacgcccag 
Isqi lap nny ypak nla pia Inaq 

841 gacgactttt cctctacccc cataactatg aattacaatc agttcctgga gctggaaaag acgaagcagc 
ddf sst pitm nyn qfl elek tkq 

911 tgagactaga caccgatcag gtgtatggaa acatagcgac atataacttt gagaacggcc gcgtgcgcgt 
Irl dtdq vyg nia tynf eng rvr 

981 cgacactggg tcaaactggt ctgaagttct gccgcaaatt caagagacaa ccgccagaat tatctttaat 
vdtg snw sev Ipqi qet tar iifn 

1051 gggaaggact tgaaccttgt cgaacgtaga attgccgccg tgaaccccag tgatccactc gagacgacta 
gkd Inl verr iaa vnp sdpl ett 

I 121 aaccggatat gacactgaaa gaggctctga agattgcctt cggattcaac gaacctaatg gcaatttgca 
kpd mtlk eal kia fgfn epn gnl 

1191 gtatcagggg aaagacatca cagagtttga tttcaatttc gatcagcaga cttcccaaaa tatcaaaaat 
qyqg kdi tef dfnf dqq taq nikn 

1261 cagttggcag agctgaatgc caccaatatc tacacggttc tcgataaaat caaacttaac gccaagatga 
qla eln atni ytv Idk ikln akm 

1331 acatattgat tcgagacaaa cgcttccact acgaccgcaa caatatagcc gtaggcgctg atgagtctgt 
n il irdk rfh ydr nnia vga des 

1401 cgtcaaggag gctcataggg aagttatcaa cagcagtact gaagggctgt tacttaatat cgacaaggac 
7vke ahr evi nast egl lln idkd 

1471 attcggaaga tcctgtccgg gtatatcgtg gagatcgagg ataccgaggg cctgaaggaa gtcattaacg 
irk lis gyiv eie dte glke vin 
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Figure 2B 

1541 accgctatga tatgctgaac atttccagct tacgacagga cggtaagaca tttattgact ttaaaaagta 
dry dmln las Irq dgkt fid fkk 

1611 taacgacaag ctacccctgt: acatttccaa cccaaattac aaagttaatg tgtatgctgt aaccaaggag 
yndk Ipl yia npny kvn vya vtke 

1681 aacacaatca tcaatccaag cgagaacggc gataccagca caaatggaat caaaaagatc cttatattta 
ntl inp seng dts tng ikki li£ 

1751 gtaaaaaagg ctacgagatc ggttgaggat cc 
skk gyei g- 
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1 gatatcgcca ccatggatgc aatgaagaga gggctctgct gtgtgctgct gctgtgtgga 
md amkr glc cvl llcg 

61 gcagtcttcg tttcgcccag cgaagtgaag -caagaaaatc gacttctgaa cgagagcgaa 
avf vap sevk qen rll nese 

V 

121 agttcatcac agggtcttct cggatactac ttcagtgact tgaatttcca agoaccaatg 
a B B qgl Igyy fsd Inf qapm 

IBl gtggtgacta gtagcaccac cggcgatttg ageattccca gctctgagtt ggagaacatt 
vvt sat tgdl sip sse leni 

241 cccagcgaaa atcagtactt ccagtctgct atctggtccg gattcattaa ggttaaaaag 
pse nqy fqsa iws gfi kvkk 

301 tccgacgaat atacatttgc tacctcggcg gataaccatg tgacaatgtg ggtggaogac 
fide ytf atsa dnh vtm wvdd 

361 caggaagtga tcaacaagge ttcaaacfcct aataaaatcc ggctcgagaa ggggaggctc 
qev ink aans nki rlo kgrl 

421 taccagatca aaattcagta ccagcgggaa aaccctacag aaaaaggact cgatttcaag 
yqi kiq yqre npt ekg Idf k 

481 ctgtactgga cagatagcc^ aaacaagaaa gaagttatca gctoagacaa tctgcagtta 
lyw tds qnkk evi asd nlql 

541 cccgagctca agcagaagag ttctaataca agcgctgggc caactgtgcc cgacagagac 
pel kqk ssnb sag ptv pdrd 

601 aatgatggaa tccctgatag tctagaggtt gagggataca cggtagatgt caagaacaaa 
ndg ipd alev egy tvd vknk 

661 aggacttttc tctcgccttg gabctoaaat atccatgaga agaaggggct taccaagtac 
rtf lap wian ihe kkg Itky 

721 aagtcctccc ccgagaagtg gtctaccgct tccgatccat atagcgattt cgagaaggtc 
ksa pek wata adp ysd fekv 

781 acaggccgga tcgataaaaa tgtgtctcca gaggctagac accccctggt agcagcctac 
tgr idk nvsp ear hpl vaay 

841 ccgattgtac acgtggacat ggagaacatc attctaagca aaaacgagga ccagtccaca 
piv hvd meni ils kne dqat 

901 caaaacactg actccgagac ccgcaccata tctaaaaaca ccagtacttc aaggacccac 
qnt dse trti skn tat arth 

961 acctctgaag tgcacggcaa tgcggaagtc catgcatcgt ttttcgatat tggtggctcc 
tse vhg naev has ffd iggs 

1021 gtgtcagccg gctttagcaa tagcaackcc tcgacsggttg ccattgacca ctcactgtca 
vsa gfs nsns etv aid hale 
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1081 ttagcaggtg agaggacttg ggctgaaact 
lag ert waet 

1141 ctcaacgcaa atattcg g ta cgtcaacaca 
Ina nir yvnt 

1201 acgacaagtc ttgtcctggg caaaaatcag 
tts Ivl gfcng 

1261 cagctgagco agatcctcgc ccctaacaac 
qls qil apnn 

1321 gocotgaaog cccaggacga cttttcctct 
aln aqd dfss 

1381 ctggagctgg aaeiagacgaa gcagctgaga 
lei ekt kqlr 

1441 gcgacatata actttgagaa cggccgcgtg 
aty nfe ngrv 

1501 gttctgccgc aaattcaaga gacaaccgcc 
vlp qiq etta 

1561 cttgtcgaac gtagaattgc ogcogtgaac 
Ive rri aavn 

1621 gatatgacao tgaaagaggc tcbgaagatt 
dmt Ike alki 

1681 ttgcagtatc aggggaaaga catcacagag 
Iqy qgk dite 

1741 oaaaatatca aaaatcagtt ggcagagctg 
qni knq lael 

1801 aaaatcaaac ttaacgccaa gatgaacata 
kik Ina kmni 

1861 cgcaacaata tagccgtagg cgctgatgag 
rnn lav gads 

1521 atcaacagca gtactgaagg gcbgttactt 
Ina ate gill 

1981 tccgggtata tcgtggagat cgaggatacc 
agy ive iedt 

2041 tatgatatgc tgaacatttc cagcttacga 
ydm Ini sslr 

2101 aagtataacg acaagctacc cctgtacatt 
kyn dkl plyi 



PCT/US2003/028199 



atgggtctga ataccgccga tacggcoogg 
rogl nta dtar 

gggactgctc ctatatataa cgtgctgcct 
gta piy nvlp 

accctcgcaa ccattaaggc aaaggaaaat 
tla tik aken 

tattatccat ccaaaaattt agcccccata 
yyp Bkn lap! 

acccccataa ctatgaatta caatcagttc 
tpi tmn ynqf 

ctagacaccg atcaggtgta tggaaacata 
Idt dqv ygni 

cgcgtcgaca ctgggtcaaa ctggtctgaa 
rvd tgs nwse 

agaattatct ttaatgggaa ggacttgaac 
rii fng kdln 

cccagtgatc cactcgagao gactaaaccg 
pad pie ttkp 

gccttcggat tcaacgaacc taatggcaat 
afg fne pngn 

tttgatttca atttcgatca gcagacttcc 
fd£ nfd qqts 

aatgccacca atatctacac ggbtctcgat 
nat niy tvld 

ttgattcgag acaaacgctt ccactacgac 
lir dkr fhyd 

tctgtcgtca aggaggctca tagggaagtt 
svv kea hrev 

aatatcgaca aggacattcg gaagatcctg 
nid kdi rkil 

gagggcctga aggaagtcat taacgaccgc 
egl kev indr 

caggacggta agacatttab tgactbtaaa 
qdg ktf idfk 

tccaacccaa attacaaagb taabgtgtat 
anp nyk vnvy 
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Figure 3C 

2161 gctgtaacca aggagaacac a^tcatcaat ooaagcgaga acggcgatac cagcacaaat 
avit ken tiin pee ngd tetn 

2221 ggaatcaaaa agatccttat atttagtaaa aaaggotacg agatcggttg aggatcc 
gik kil ifsk kgy eig 
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1 gatatcgcca ccatggatgc aatgaagaga gggctctgct gtgtgctgct gatgtgtgga 

end arakr glc cvl llcg 

61 gcagtcttcg tttcgcccag cgccggcggg catggggacg ttggcatgca tgtgaaagaa 
avf vep aagg hgd vgtn hvke 

121 aaggagaaaa acaaggacga aaacaagogt aaagacgaag aacgtaataa aacacaggag 
kek nkd enkr kde ern ktqe 

181 gaacacbtaa aggagatcat gaagcacata gtaaagattg aggtaaaagg cgaagaggct 
ehl kel mkhi vki evk geea 

241 gtaaagaagg aggcagcaga aaaactgttg gagaaggtgc cttctgacgt cttagagatg 
vkk eaa ekll ekv pad vlera 

301 tataaggcca tcggcggtaa gatctatatc gtggacggag acatcactaa acacatiatct 
yka igg kiyi vdg dit khis 

361 ctcgaagctc tctccgagga caagaaaaag attaaagaca tctacgggaa ggatgcctta 
lea Ise dkkk ikd iyg kdal 

421 ttgcacgagc actacgttta cgoaaaggag ggetatgagc ccgtgctcgt tattcsagagt 
Ihe hyv yake gye pvl viqa 

481 agtgaggact acgtcgagaa taccgagaaa gctctgaatg tgtattacga gatcg^aaag 
sed yve ntek aln vyy eigk 

541 attctgtccc gggaaatcct gtccaaaatc aaccagccat aocagaaatt ccttgatgtt 
lie rdi laki nqp yqk fldv 

601 ctbaacacaa tcaaaaacgc gtcagatagc gaogggcagg atcttctgtt tacaaatcaa 
Int Ikn asda dgq dll ftnq 

661 ctcaaggaac accccactga tttcagcgtg gagttcctcg agoagaattc taacgaagtc 
Ike hpt dfav e£l eqn snev 

721 caggaggtgt hcgccaaggc atttgogtac tatatcgaac cccagcatcg cgatgtgctc 
qev fak afay yie pqh rdvl 

781 cagctgtacg ccccggaggc atttMCtac atggacaaat tcaatgaaca ggagattaat 
qly ape afny mdk fne qein 

841 ctgtctctgg aggaactgaa agaccagagg atgctctccc ggtatgaaaa gtgggaaaag 
Isl eel kdqr mla rye kwek 

901 atcaaacagc attaccagca ttggtccgac tccctgtcag aagaggggcg cggcctgttg 
ikq hyq hwad als eeg rgll 

961 aaaaagttgc agattcccat cgagcctaag aaagatgata taataoactc tctaagccag 
kkl qlp iepk kdd lih alaq 

1021 gaggagaagg aactcctgaa gcggatacaa atcgactcat ccgatttcct tagcacagaa 
eek ell krlq Ids adf late 
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. Pigfure 4B 

1081 gagaaggagt ttctaaaaaa acttcagata gatattagag attcactgag c^ggaagag 
eke flk klqi dir dsl seee 

1141 aaggagctgc tcaaccgaat tcaagtcgat agttcgaacc ccttgtcaga aaaagagaag 
kel Inr iqvd ssn pis ekek 

1201 gaattcctga aaaagttgaa gatcgacatc cagccgtacg atattaatca gcggctacaa 
efl kkl kldi qpy din grlq 

1261 gacaccggcg gtctgattga bagccccagc atcaaccttg acgtacggaa gcaatataag 
dtg gll dspa Inl dvr kqyk 

1321 cgcgacattc aaaatatcga cgccctatta catcaatcca taggctccac gotatacaat 
rdi qni dall hqs igs tlyn 

1381 aaaatctatc tatacgaaaa catgaatatt aacaatctca ccgctacact gggagcggao 
kiy lye nmni nnl tat Igad 

1441 ctggtcgata gtacagacaa cacaaagata aaoagaggta ttttcaacga attcaaaaag 
Ivd std ntki nrg ifn efkk 

1501 aactttaagfc attcgatcag cagtaactat atgattgttg acatcaatga aoggcc'cgca 
ixfk ysi sany miv din erpa 

1561 ttagaoaatg agaggttgaa gtggagaatt caaotgagto ctgatactag ggccggctab 
Idn erl kwrl qls pdt ragy 

1621 ctggagaacg ggaaactgat cttacagcga aacatcgggc tggagatoaa ggatgtgcag 
len gkl ilqr nig lei kdvq 

1681 attatcaagc agagcgaaaa agaatacatt cgcatcgacg ccaaggtggt gcctaagtca 
iik qse keyi rid akv vpka 

1741 aagatcgata ccaagatcca ggaagctcag cbqaacatta accaggagtg gaataaagct 
kid tki qeaq Ini nqe wnka 

1801 cttggtctgc caaaatacac caaacttatc acctttaatg tgcacaacag gtatgcctct 
Igl pky tkli tfn vhn ryas 

1861 aatatcgtcg agtcagcata cctgattctc aatgaatgga agaacaatat tcagtctgac 
niv esa ylil new knn iqsd 

1921 ctgatcaaga aggtcacgaa ttatctggtg gacsggaaatg gcagattcgt gttcaccgac 
lik kvt nylv dgn grf vftd 

1981 ataactttgc caaacattgc cgagcaatac actcatcagg atgaaattta cgagcaagtc 
itl pni aeqy thq dei yeqv 

2041 cactccaaag gtctgtatgt tccagagtca agatcgattc tgctccatgg tccatccaaa 
hak gly vpes rai llh gpsk 

2101 ggggttgagc ttcgaaacga ttctgaggga tttatcsgctg actttggagc cgctgtggat 
gve Irn daeg fia dfg aavd 
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Figure 4C 

2161 gactacgccg gatacctgtt ggataagaat cagtctgatc tcgtgacaaa tagoaaaaaa 
d y a gyl Idkn qad Ivt nskk 

2221 ttcatagata ttttcaagga ggaagggagt aacctgactt cctatggocg cacgaacgag 
fid ifk eegs nit eyg rtne 

2281 gctgaatttt ttgoggaagc ctttagactt atgcacagca ccgaccatgc tgaaaggttg 
ae£ fae afrl mha tdh aerl 

2341 aaggtgcaaa agaatgcccc taaaaccttc cagttcataa atgaccagat CGiagttcatc 
kvq kna pktf qfi ndq ikfi 

2401 atcaiactctt gaggatcc 
ins 
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Figure 5A 

1 gatatcgcca ccatggatgc aatgaagaga gggctctgct gtgtgctgct gctgtgtgga 
md amlcr glc cvl llcg 

61 gcagtcttog tttcgcccag cgcoggcggg catggggacg ttggcatgca tgtgaaagaa 
avf vsp sagg hgd vgm hvke 

121 aaggagaaaa acaaggacga aaacaagcgt aaagacgaag aacgtaataa aacacaggag 
kek nkd enkr kde ern ktge 

181 gaacacttaa aggagatcat gaagcaoata gtaaagattg aggtaaaagg cgaagaggct 
ehl kei rakhl vkl evk geea 

241 gtaaagaagg aggcagcaga aaaactgttg gagaaggtgc cttctgacgt cttagagatg 
vkk eaa ekll ekv pad vlem 

301 tataaggcca tcggoggtaa gatcbatatc gtggacggag acatcaataa acacab'atct 
yka ±gg klyi vdg dit khis 

361 ctcgaagctG tctccgagga caagaaaaag attaaagaca tctacgggaa ggatgccbta 
lea Ise dkkk. ikd iyg kdal 

421 ttgcacgagc actacgttta cgcaaaggag ggctatgagc ccgtgotcgt tattcagagt 
Ihe hyv yake gye pvl vlqs 

481 agtgaggact acgtogagaa taccgagaaa gctctgaatg tgbattaoga gabcggaaag 
sed yve ntek aln vyy eigk 

S41 attctgbccc gggacabcct gtccaaaabc aaccagccab accagaaabb cctbgabgbb 
lis rdl Iskl ngp ygk fldv 

601 cbbaacacaa bcaaaaacgc gbcagatagc gacgggcagg atctbcbgtb bacaaabcaa 
Inb Ikn asds dgg dll fbnq 

661 cbcaaggaac. accccactga bbtcagcgbg gagbbcctcg agcagaabbc baacgaagbc 
Ike hpb dfsv efl eqa enev 

721 caggaggbgt bcgccaaggc abbbgcgbac tabatcgaac cccagcabcg cgabgtgobc 
qev fak afay yie pqh rdvl 

781 cagctgtacg ccccggaggc abbtaacbac abggacaaab bcaabgaaca ggagabtaab 
qly ape afny mdk fne qeia 

841 cbgbctcbgg aggaacbgaa agaccagagg abgcbcbccc ggbabgaaaa gbgggaaaag 
Isl eel kdqr mis rye kwek 

901 abcaaacagc atbaccagca bbggbccgac bcccbgbcag aagaggggcg cggccbgbbg 
ikq hyq hwsd sis eeg rgll 

961 aaaaagbbgc agabbcccab cgagccbaag aaagabgaba baabacacbc tcbaagccag 
kkl qip iepk kdd iih slsg 

1021 gaggagaagg aacbccbgaa gcggabacaa atcgactcat ccgabbbccb bagcacagaa 
eek ell kriq ids sdf Isbe 
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1081 gagaaggagt ttctaaaaaa acttcagata 
eke flk klqi 

1141 aaggagctgc tcaaccgaat toaagtcgat 
kel Inr iqvd 

1201 gaattcctga aaaagttgaa gctcgacatc 
efl kkl kldi 

1261 gacaccggcg gtctgattga tagccccagc 
dtg gli dsps 

1321 cgcgacattc aaaatatcga cgcoctatta 
rdi qni dall 

1381 aaaatctatc tatacgaaaa catgaatatt 
kiy lye nmni 

1441 ctggtcgata gtacagacaa cacaaagata 
Ivd std ntki 

1501 aactttaagt attcgatcag cagtaactat 
nfk y.si ssny 

1561 ttagaeaatg agaggttgaa gtggagaatt 
Idn erl kwri 

1621 ctggagaacg ggaaactgat cttacagcga 
le n gkl ilqr 

1681 attatcaagc agagcgaaaa agaatacatb 
iik qse key! 
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g^tattagag attcactgag ogaggaagag 
dir dsl seee 

agttcgaacc ccttgtcaga aaaagagaag 
sen pis ekek 

cagccgtacg atattaatca goggctacaa 
qpy din qrlq 

atcaaccttg acgtacggaa gcaatataag 
inl dvr kqyk 

catcaatcca taggotccac gqtatacaat 
hqs igfl tlyn 

aa c aatctca ccgctacaat gggagcggac 
nnl tat Igad 

aacagaggta ttttcaacga attcaaaaag 
nrg ifn e£kk 

atgattgttg acatcaatga acggcccsgca 
miv din erpa 

caactgagtc cbgatactag ggccggctat 
qls pdt ragy 

aaoatcgggc tggagatcaa ggatgtgcsag 
nig lei kdvq 

cgcatcgacg ccaaggtggt gtagggatcc 
rid a kv v- 
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Figure S 

1 gatatcgcca ccatggatgc aatgauigaga 
rod a m k r 

61 gcagtcttcg tbtcgcccag cgccggcggg 
av£ vsp sagg 

121 aaggagaaaa acaaggacga aaacaagcgt 
kek nkd enkr 

ifll gaaoacttaa aggagatcat gaagcaoata 
ehl kei rokhi 

241 gtaaagciagg aggcagcaga aaaactgttg 
vkk eaa ekll 

301 tataaggcca tcggcggrtaa gatctatabc 
yka igg kiyi 

361 ctcgaagctc tctccgagga caagaaaaag 
lea Ise dkkk 

42l ttgcacgagc actacgttta ogoaaaggag 
Ihe hyv yake 

481 agtgaggact acgtcgagaa taccgagaaa 
-sed yve ntek 

541 attctgtccc gggacatcct gtccaaaatc 
lis rdi Iski 

601 cttaacacaa tcaaaaacgc gtcagatagc 
lot ikn asde 

661 ctcaaggaac accccactga tttcagcgtg 
Ike hpt d£sv 

721 caggaggtgt tcgccaaggc attttgagga 
qev fak a£- 



gggctctgct gtgtgctgct gctgtgtgga 
glc cvl lleg 

catggggacg ttggcatgca tgtgaaagaa 
hgd vgm hvke 

aaagacgaag aaogtaataa aacacaggag 
kde ern ktqe 

gtaaagabhg aggtaaaagg cgaagaggct 
vki evk geea 

gagaaggtgc cttctgacgt cttagagatg 
ekv psd vlem 

gtggacggag acatcactaa acacabatct 
vdg dit khis 

attaaagaca tctacgggaa ggatgcctta 
ikd iyg kdal 

ggctatgagc ccgtgotcgt tattcagagt 
gye pvl viqs 

gctctgaatg tgtabbacga gatcggaaag 
ala vyy eigk 

aaccagccat aocagaaatt ccttgatgtt 
^*IP y<I^ fldv 

gacgggcagg atcttctgtt tacaaatcaa 
dgq dlX ftnq 

gagttcctcg agcagaattc taacgaagtc 
e£l eqn sne'v 

tec 
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Figure 7A 

1 gatatcgcca ccatggatgc aatgaagaga gggokctgct gbgtgctgct gctgtgtgga 
md arakr gle cvl llcg 

€1 gcagtcttcg tttcgcocag oagogctggg ccaactgtgc ccgacagaga caatgatgga 
av f vsp ssag ptv pdr dndg 

121 atccctgata gtctagaggt tgagggatac acggtagatg tcaagaacaa aaggaotttt 
tpd Ble vegy tvd vkn krtf 

181 ctetcgeett ggatctcaaa tatocatgag aagaaggggc ttaccaagta caagtcctcc 
lop wis nihe kkg Itk ykss 

241 ocogagaagt ggtctaccgc ttccgatcca tatagcgatt tcgagaaggt caoaggcogg 
pek wet asdp ysd fek vtgr 

301 atcgataaac aggtgtctcc agaggctaga caccccctgg tagcagocta cccgattgta 
idk qvs pear hpl vaa ypiv 

361 cacgtggaca tggagaacat cattctaagc aaaaacgagg accagtccac acaaaacact 
hvd men ills kne dgs tqnt 

421 gactccgaga cocgcaccat atctaaacag accagtactt caaggacoca cacctctgaa 
dae trt iekq tat srt htse 

481 gtgcacggca atgcggaagt ccatgcatcg tttttcgata ttggtggctc cgtgtcagcc 
vhg nae vhas ££d igg svsa 

541 ggctttagca atagccagtc Ptcgacggtt gccattgacc actcactgtc attagcaggt 
g£8 nag a a t v aid hsl slag 

601 gagaggactt gggctgaaac tatgggtctg aataccgccg atacggcccg gctcaacgca 
ert wae tmgl tita dta rlna 

661 aatattcggt acgtcaacac agggactgct cctatatata acgtgctgcc tacgacaagt 
nlr yvn tgta piy nvl ptte 

721 cttgtcctgg gcaaacagca gaocctogca accattaagg caaaggaaaa tcagctgagc 
Ivl gkq qtla tik ake nqls 

781 cagatcctcg cccctaacaa ctattatcca tccaaaaatt tagcccccat agccctgaac 
qil apn xxy yp skn lap lain 

841 gcccaggacg acttttcctc tacccccata actatgaatt acaatcagtt cctggagctg 
aqd dfs stpi tmn yaq flel 

901 gaaaagacga agcagctgag actagacacc gatcaggtgt atggaaacat agcgacatat 
ekt kql rldt dqv ygn iaty 

961 aactttgaga acggccgcgt gcgcgtcgac actgggtcac agtggtctga agttctgccg 
nfe ngr vrvd tga qwe evlp 

1021 caaattcaag agacaaccgc cagaattato tttaatggga aggacttgaa ccttgtcgaa 
qiq etb arii fng kdl nlve 
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Plgure 7B 

lOBl cgtagaattg ccgccgtgea goccagtgat 
rri aav qpsd 

1141 ctgaaagagg ctctgaagat tgccttcgga 
Ike alk iafg 

1201 cagggg€uiag acabcacaga gtttgatttc 
qgk dlt efdf 

1261 aaaaatcagt tggcagagct gcaggccacc 
knq lae Iqat 

1321 dttaacgcca agatgaacat attgattaga 
Ina kmn llir 

1381 atagccgtag gcgctgatga gtctgbcgtc 

iav gad eavv 

c 

1441 agtactgaag ggotgttact taatatcgac 
8te gll Inld 

1501 atcgtggaga tcgaggatac cgagggcctg 
ive ied tegl 

15€1 ctgoagattt ccagcttacg acaggaoggt 
lq± 8Sl rqdg 

1621 gacaagctac ccctgbacat ttccaaccca 
dkl ply isnp 

1681 aaggagaaca caatcatcca gcceiagcgag 
ken tii qpse 

1741 aagatcctta tatttagtaa aaaaggctac 
kil ifs kkgy 



ccacfccgaga pgactaaacc ggatatgaca 
pie ttk pdmt 

ttcaacgaac ctaatggcaa tttgcagtat 
fne png nlqy 

aatttcgatc agcagacttc ccaaaatatc 
nfd qqt sqnl 

aatatcbaoa oggttcbcga taaaabcaaa 
niy t vl dkik 

gacaaacgct bccacbacga ccgcaacaab 
dkr fhy drnn 

aaggaggctc abagggaagb batcoagagc 
kea hre vlqs 

aaggacatbc ggaagatccb gbccgggbab 
kdi rki legy 

aaggaagtca ttaacgaccg ctatgatatg 
k ev ind rydm 

aagaoattba bbgactbtaa aaagtataac 
fcbf id,f kkyn 

aattacaaag tbaabgbgta tgcbgbaacc 
nyk vnv yavb 

aacggcgata ccagcacaaa tggaatcaaa 
ngd bsb nglk 

gagabcggbb gaggabcc 
e i g 
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Figure 8A 

1 gatatcgcca ocatggatgc aatgaagaga gggctctgot gtgtgctgct gctgtgtgga 
md amkr glc cvl llcg 

'61 gcagtcttcg tttcgcocag cgocggcggg aatggggaog ttggcatgca tgtgaaagaa 
avf vsp sagg hgd vgm hvke 

121 aaggagaaaa acaaggacga aaacaagcgt aaagacgaag aacgtcagaa aacacaggag 
kek nkd enkr kde erq ktqe 

181 gaacacttaa aggagatcat gaagcacata gtaaagattg aggtaaaagg cgaagaggct 
ehl kel mkhi vki evk geea 

241 gtaaagaagg aggcagoaga aaaaotgttg gagaaggtgc cttctgacgt ottagagatg 
vkk eaa ekll ekv pad vlem 

301 tataaggcca tcggcggtaa gatctatatc gtggacggag acatcactaa acacatatct 
yka Igg kiyi vdg dit khla 

361 ctcgaagctc tctccgagga caageiaaaag attaaagaca tctacgggaa ggatgcctta 
laa Ise dkkk ikd lyg kdal 

4^1 ttgcacgagc actacgttta cgcaaeiggag ggctatgagc ccgtgctcgt tattcagagt 
Ihe hyv yake gya pvl viqs 

481 agtgaggact acgtcgagaa taccgagaaa gctctgaatg tgtattacga gatcggaaag 
eed yve ntek aln vyy eigk 

541 attctgtccc gggacatcct gtccaaaatc aaccagccat accagaaatt ccttgatgtt 
ils rdi iBkl nqp yqk fldv 

601 cttaacacaa tcaaacaggc gtcagatagc gacgggcagg abcttotgtt tacaaatcaa 
Int ikq asds dgq dll ftnq 

661 ctceiaggaac accccactga tttcagcgtg gagttcctcg agcagaatkc taacgaagtc 
Ike hpt dfflv efl eqn snev 

721 caggaggtgt tcgccaaggc atttgcgtac tatatcgaac cccagcatcg cgatgtgctc 
qev fak afay yie pqh rdvl 

781 cagctgtacg ccccggaggc atttaactac atggacaaat tcaatgaaca ggagattcag 
qly ape afny mdk fae qeiq 

841 ctgtctctgg aggaactgaa agaccagagg atgctctccc ggtatgaaaa gtgggaaaag 
Isl eel kdqr mis rye kwek 

901 atcaaacagc attaccagca ttggtccgac tccctgtcag aagaggggcg cggcctgttg 
ikq hyq hwsd sis eeg rgll 

961 aaaaagttgc agattcccat cgagcctaag aaagatgata taabacactc tctaagccag 
kkl qlp l epk kdd iih slsq 

1021 gaggagaagg aactcctgaa gcggatacaa atcgactcat ccgatttcct tagcacagaa 
eek ell krlq ids adf late 
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Figure 8B 

1081 gagaaggagt ttctaaaaaa acttcagata 
eke flic klqi 

1141 aaggagctgc tcaacogaat tcaagtcgab 
kel Inr iqvd 

1201 gaattcctga aaaagttgaa gctcgacatc 
e£l kkl kldl 

1261 gacaccggcg gtctgattga tagccccagc 
dtg gll deps 

1321 cgcgacattc aaaatatcga cgccctatta 
rdl qni dall 

1381 aaaatctatc tatacgaaaa catgaatatt 
kiy lye nmni 

1441 otggtcgata gtacagacaa cacaaagata 
Ivd 8td ntki 

1501 aacbttaagt attcgatcag cagtaaotat 
n£k ysi ssny 

1561 ttagacaatg agaggttgaa gtggagaatt 
Idn erl kwrl 

1621 ctggagaacg ggaiaactgat cttacagcga 
len. gkl ilqr 

1681 attatcaagc agagcgaaaa agaatacatt 
ilk qse key! 

1741 aagatcgata ccaagatcca ggaagctcag 
kid tki qeaq 

1801 cttggtctgc caaaatacac caaacttatc 
Igl pky tkii 

1861 aatatcgtcg agtcagcata cctgattctc 
niv esa ylil 

1921 ctgatcaaga aggtcacgaa ttatctggtg 
lik kvt nylv 

1981 ataiactttgc caaacattgc cgagcsiatac 
itl pni aeqy 

2041 cactccaaag gtctgtatgt tccagagtca 
hsk gLy vpee 

2101 ggggttgagc ttcgacagga ttctgaggga 
gve Irq dseg 
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gatattagag attcactgag cgaggaagag 
dir dsl seee 

agttogaacc ccttgtcaga aaaagagaag 
Ban pis ekek 

cagccgtacg atattaatoa gcggotacaa 
qpy din qrlq 

atcaaccttg acgtacggaa gcaatataag 
inl dvr kqyk 

catcaatcca taggctccac gctatacaat: 
hqs igs tlyn 

aaccagctca ccgctacact gggagcggac 
nql tat Igad 

aacagaggta ttttcaacga attcaaaaag 
nrg ifn. efkk 

atgattgttg acatcaatga acggcccgca 
miv din erpa 

caactgagtc ctgatactag ggccggctat 
q. Is pdt ragy 

aacatogggc tggagatcaa ggatgtgcag 
nig lei kdvq 

cgcatcgacg ccaaggtggt gcctaagtca 
rid akv vpks 

ctcaacatta accaggagtg gaataaagct 
Ini nqe wnka 

acctttaatg tgcacaacag gtatgccbct 
t£n vhn ryas 

aatgaatgga agaacaatat tcagtcbgac 
new knn iqsd 

gacggaaatg gcagattcgt gbtcaccgac 
dgn grf vfbd 

actcatcagg atgaaattba cgagcaagtc 
thq dei yeqv 

agabcgatbc tgctccatgg bccabccaaa 
rsi llh gpsk 

bbbatcgctg acttbggagc cgctgtggat 
£ia dfg aavd 
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Figure 8C 

2161 gactacgccg gataectgtt ggataagoag cagtotgatc tcgtgacaaa tagcaaaaaa 
dya gyl Idkq qsd Ivt nakk 

2221 ttcatagata ttttcaagga ggaagggagt cagctgactt cctatggocg cacgaaogag 
£id i£k eegs git ayg rtne 

2281 gotgaatttt: ttgcggaagc ctttagactt atgcacagca ccgaccatgc tgaaaggttg 
ae£ fae afrl mhs tdh aerl 

2341 aaggtgcaaa agaatgcocc taaaaccttc cagttcataa atgaccagat caagttcatc 
kvq kna pktf q£l ndq ik£i 

2401 atcaactctt gaggatcc 
ins 
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TPA-Human PA 82 agcgctgggccaactgtgcccgacagagacaatgatggaatccctgatagtctagaggttgagggataca 

B.a,- PA 595 ,.t a..t..g..t..a...c.t teat a. .a t- 

TPA-amnan PA 152 cggtagatgtcaagaacaaaaggacttttctctcgccttggatctcaaatatccatgagaagaaggggct 

B.a.- PA 665 ....t a..t a t.,a..a t..t t a a. .at. 

•TPA-Human PA 222 taccaagtacaagtcctcccccgagaagtggtctaccgcttccgatccatatagcgatttcgagaaggtc 

B.a.- PA 735 a a. . t. .a. .a. . t. .t. *a. .a. . .age, .g t g.«c..t a t 

TPA-Humon PA 292 acaggccggatcgata&aaatgtgtctccagaggctagacaccccctggtagcagcctacccgattgtac 

B,a-- PA 805 a t g a. .a a t..g t..t 

TPA-Huxnan PA 362 acgtggacatggagaacatcattctaagcaaaaacgaggaccagtccacacaaaacactgactccgagac 

B.a-- PA 875 .t..a,.t t..t ctca t t. .a g..t tagt..a-. 

TPA-Huxnan PA 432 ccgcaccatatctaaaaacaccagtacttcaaggacccacacctctgaagtgcacggcaatgcggaagtc 

B,a.- PA 945 ga«a..a.*.ag t. .ttc. . -aagt a..t.,tag a,.t..a a g 

TPA-Human PA 502 catgcatcgrtttttcgatattggtggctccgtgtcagccggctttagcaatagcaactcctcgacggttg 

B.a.-- PA 1015 g c-.t gagt. . a. . t. .a. .a t . . .teg. . t , -aagt c. 

TPA-Huzaan PA 572 ccattgaccactcactgtcattagcaggtgagaggacttgggctgaaactatgggtctgaataccgccga 

B.a-- PA 1085 .a t..t a. .tc g,.a.,a a t.a t.. 

TPA*-Human PA 642 tacggcccggctcaacgcaaatattcggtacgtcaacacagggactgctcctatatataacgtgctgcct 

B.a.- PA 11S5 . . ,a. .aa.at.a.,t. .c a.a. . t . .a. . t. . t g a.-c..c t.a.,a 

TPA-Hmoan PA 712 acgacaagtcttgtcctgggcaaaaatcagaccctcgcaaccattaaggcaaaggaaaatcagctgagce 

H.a,- PA 1225 ttcgt.a, -gt.a. .a a. .a g,,a a..t c..at.a..t. 

TPA-Huxnan PA 782 agatcctcgccectaaeaactattatccatccaaaaatttagcccccatagccctgaacgcecaggacga 

B.a.- PA 1295 .a-.a.^t..^ t,.t t..t .g. ,g. -a. .c. .at.a. .t. .a. .a 

TPA-Huiaan PA 852 cttttcctctacccccataaetatgaattaeaatcagttcctggagctggaaaagacgaagcagctgaga 

B,a.- PA 1365 t..cagt t..a.,t..a a, .t. • t. . .t .a. , . . .a a. .at. a... 

TPA-Hunan PA 922 ctagacaccgatcaggtgtatggaaacatagcgacatataactttgagaacggccgcgtgcgcgtcgaca 

B.a,- PA 1435 t....t..g a. .a g,.t a c.t a. , t. .aa.a. , .a.g. .g. .t. 

TPA-Huroaa PA 992 ctgggtcaaactggtctgaagttctgccgcaaattcaagagacaaccgccagaattatctttaa.tgggaa 

B.a,- PA 1505 .a. .eg ag gt.a a t. .act. ,c. . t a.. 

TPA-Buman PA 1062 ggacttgaaccttgtcgaaegtagaattgccgccgtgaaceccagtgatccactcgagaegactaaaccg 

B.a.- PA 1575 a..t..a..t..g,,a...a.gc.g..a..g,.g-,t».t.,t t.a..a 

TPA-Human PA 1132 gatatgacactgaaagaggctctgaagattgccttcggattcaacgaacetaatggcaatttgcagtatc 

B.a.- PA 1645 t.a a . . c. . t . .a. . a. . a- , t t g a. . e. .a. .a. . . . 

TPA-Human PA 1202 aggggaaagacatcacagagtttgatttcaatttcgatcagcagacttcccaaaatatcaaaaatcagtt 

B.a.- PA 17X5 .a a..c..a t a..a,.a..t g 

TPA-Human PA 1272 ggcagagctgaatgccaccaatatctacacggttctcgataaaatcaaacttaacgccaagatgaacata 

B.a.- PA 1785 a. .g. .at.a. .c. .a. . t. .c. .a. .t. , t, .at . a , t,a.,t..a..a t..t 

TPA-fluioaa PA 1342 ttgattcgagacaaaegcttccaetacgacegcaacaatatagccgtaggcgctgatgagtctgtcgtca 

B.a.- PA 1855 ..a..aa t t . .t. .t. . t. . ta.a. .t. .c a..t..g..g a..a..t. 

TPA-Human PA 1412 aggaggctcatagggaagttatcaacagcagtactgaagggctgttacttaatategaeaaggacattcg 

B.a,- PA 1925 a a. .t. .ttcgtca. .a. .g. .at«a. .gt.a t..t t. .aa. 

TPA-Human PA 1482 gaagatcctgtccgggtatatcgtggagatcgaggataccgagggcctgaaggaagtcattaacgacegc 

B.a.- PA 1995 a. .a. .at. a. .a. .t t. .a. .a. .t. .a t. -a. .g. .t. ,a t. .a. .t- . .a.a 

TPA-Human PA 1552 tatgatatgctgaacatttccagcttacgacaggacggtaagacatttattgaetttaaaaagtataacg 

B.a.- PA 2065 t t t..t g. . a. - 1, .a. .a a.,t a t. 
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TPA-Human PA 1622 acaagctacecctgtacatttccaacccaaattacaaagttaatgtgtatgctgtaaccaaggagaacac 



B.a. 



PA 



2135 .t..at. . . .gt.a. .aagt..t 



.t..g. .a. 



.t..t. 



TPA-Humaa PA 1692 aatcatcaatccaagcgagaacggcgataccagcacaaatggaatcaaaaagatccttatatttagtaaa 



B.a. 



PA 



2205 t. .t. .t t. .t t. .g t.,t,-c. .eg g. ,d.,tt. 



.tc- 



TFA-Human PA 1762 aaaggctacgagatcggttga 



B.a. 



PA 



2275 



.a. .a.a» 
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Figure lOA 

TPA-humaa fu 82 gaagtgaagcaagaaaatcgacttctgaacgagagcgaaagttcatcacagggtcttctc 

B.a-- PA 88 . 1 . . .t, ,a- .g- .g. .c. .gt.at.a. -t. .atca. . . tcaagt. .c gt.a.,a 

TFA-human fu 142 ggatactacttcagtgacttgaatttccaagcaccaatggtggtgactagtagcaccacc 

B.a-- PA 148 t..t t t C t..ctc.tct..t,,a 

T PA- human £u 202 ggcgatttgagcattcccagctctgagttggagaacattcccagcgaaaatcagtacttc 

B.a-- PA 208 . .g atct t..t a..a..t atcg c,.a-,t..t 

TFA-human fu 262 cagtctgctatctggtccggattcattaaggttaaaaagtccgacgaatatacatttgct 

B.a.- PA 268 ..a t a t..c..a g...agt..t 

TPA-human fu 322 acctcggcggataaccatgtgacaatgtgggtggacgaccaggaagtgatcaacaaggct 

B.a.- PA 328 .,t,.c..t t a a..t a t..t..a... 

TPA-human fu 382 tcaaactctaataaaatccggctcgagaaggggaggctctaccagatcaaaattcagtac 

B.a.- PA 388 ..t..t.....c a.at.a. .a. ,a. .a. . at. a. . t . .a, ,a a..t 

TPA-human fu 442 cagcgggaaaaccctacagaaaaaggactcgatttcaagctgtactggacagatagccaa 

B.a.- PA 448 ..a.^a t t t.g t c.-tct... 

TPA-human fu 502 aacaagaaagaagttatcagctcagacaatctgcagttacccgagctcaagcagaagagt 

B.a.- PA 508 ..t..a g, .ttctagt . .t. ,ct .a. -a. .g. .a. .at.o . .a. .a. .ate. 



TPA-humaa fu 
B.a.- PA 

TPA-human fu 
B.a.- PA 



562 tctaa tacaagcgctgggccaactgtgcccgacagagacaat 

566 - .g. .ctcaagaaaaaagcgaag. . . . . . t a. .t. .g. .t. .a. . .c. t 

604 gatggaatccctgatagtctagaggttgagggatacacggtagatgtcaagaacaaaagg 
628 teat a. .a t t a..t a 



TPA-human fu 664 acttttctctcgccttggatctcaaatatccatgagaagaaggggcttaccaagtacaag 

B.a.- PA 688 t..a..a l>..t t a a. .at. a a..t..a 

TPA-human fu 724 tccteccccgagaagtggtctaccgcttccgatccatatagcgatttcgagaaggtcaca 

B.a.- PA 748 . ,a..t..t. .a. .a- . -agc.g t g..c..t a t. . . 

TPA-huxoan fu 784 ggccggatcgataaaaatgtgtctccagaggctagacaccccctggtagcagcctacceg 

B.a.- PA 808 ..a t g a,*a a t..g t..t... 

TPA-human fu 844 attgtacacgtggacatggagaaeatcattctaagcaaaaaegaggaccagtceacacaa 

B.a.- PA 868 t..a..t t..t ctca. 1 t..a g 

TPA-humaa fu 904 aacactgactccgagacccgcaccatatctaaaaacaccagtacttcaaggacecacacc 

B.a.- PA 928 ..t tagt. .a. .ga.a. .a. * .ag t. .ttc. . .aagt a. .t..t 



TPA-human fu 964 tctgaagtgcacggcaatgcggaagtccatgcatcgtttttcgatattggtggctccgtg 

B.a,- PA 988 ag a-.t..a a g g c.t gagt..a 

TPA-human fu 1024 tcagccggctttagcaatagcaactcctcgacggttgccattgaccactcactgtcatta 

B.a.- PA 1048 -.t..a..a t . . . teg. . t. .aagt c.a t..t a..tc.. 



TPA-human fu 1084 gcaggtgagaggacttgggctgaaactatgggtctgaataccgccgatacggcccggctc 

B.a.- PA 1108 g..a..a.... a t.a t a..aa.at,a 

TPA-human fu 1144 aacgcaaatattcggtacgtcaacacagggaetgctcctatatataacgtgctgcctacg 

B.a.- PA 1168 ..t..c a.a, ,t, .a. .t. .t g a..c,.c t.a.,a..- 

TPA^huraan fu 1204 acaagtcttgtcctgggcaaaaatcagaccctcgcaaccattaaggcaaaggaaaatcag 

B.a>- PA 1228 . .ttcgt.a. .gt.a. .a a. .a g..a a..t c.a 

TPA-human fu 1264 ctgagccagatcctcgcccctaacaactattatccatccaaaaatttagcccccatagcc 

B.a.- PA 1288 t .a. . t. .a, .a. . t. .a t..t t..t c. .g. ,g. .a, .c. .a 

TPA-human €u 1324 ctgaacgcccaggacgacttttcctctacccccataactatgaattacaatcagttcctg 

B.a.- PA 1348 t. a . . t . . a . .a. . . . . t . .cagt t..a..t..a a-.t..t 



21/41 



wo 2004/024067 



PCTAJS2003/028199 



Figure lOB 



TPA-human £u 1384 gagctggaaaagacgaagcagctgagactagacaccgatcaggtgtatggaaacatagcg 

B.a,- PA 140B — t.a a a..at4a.,,t 1. .g a. .a g..t a 

TPA^buman fu 1444 acatataactttgagaacggccgcgtgcgcgtcgacactgggtcaaactggtctgaagtt 

B.a.- PA 1468 c.t a. .t. .aa.a. . *a.g« .g* .t* *a. .c. .g ag g 

TPA-human fu 1504 ctgccgcaaattcaagagacaaccgccagaattatctttaatgggaaggacttgaacctt 

B.a.- PA 1528 t.a a t. . act. , c. . t a. -a. . t. .a. - 1. .g 

TPA-human fu 1564 gtcgaacgtagaattgccgccgtgaaccccagtgatccactcgagacgactaaaccggat 

B.a«- PA 1588 . .a. . .a.gcg, .a, .g. .g. .t. *t, •t t.a. .a • 

TPA-human fu 1624 atgacactgaaagaggctctgaagattgccttcggattcaacgaacctaatggcaatttg 

B.a.- PA 1648 t.a a. .c. . t. .a. .a. .a. . t t g: a..c..a 

TPA-huzDaa fu 1684 cagtatcaggggaaagacatcacagagtttgatttcaatttcgatcagcagacttcccaa 

D,a.- PA 1708 ..a a ..».a..c..a t a. ,a. .a. . t. . . 

TPA-human fu 1744 aatatcaaaaatcagttggcagagctgaatgccaccaatatctacacggttctcgataaa 

B.a.- PA 1766 g a. .g. .at.a. .c. .a. . t. .c. .a. .t. .t. .at«a 

TPA-huznaii fu 1804 atcaaacttaacgccaagatgaacatattgattcgagacaaacgcttccactacgaccgc 

B.a.- PA 1828 t .a* . t. .a. .a. * . . .t. .t . .a. .aa 1 t. .t. .t. .t. . ta.a 

TPA*human fu 1864 aacaatatagccgtaggcgctgatgagtctgtcgtcaaggaggctcatagggaagttatc 

B.a.- PA 1888 ..t..c a..t..g..g a..a..t a a.«t 

TPA-bmoan fu 1924 aacagcagtactgaagggctgttacttaatatcgacaaggacattcggaagatcctgtcc 

B.a.- PA 1948 . .ttcgtca. .a. .g. .at.a. .gt.a t..t t. .aa.a. .a. «at.a. .a 

TPA-huioan fu 1984 gggtatatcgtggagatcgaggataccgagggcctgaaggaagtcattaacgaccgctat 

B.a.~ PA 2008 ..t t. .a. .a. .t. .a t. .a, .g* *t. .a t. .a. .t. .*a.a. . . 

TPA-huzaan fu 2044 gatatgctgaacatttccagcttacgacaggacggtaagacatttattgactttaaaaag 

B.a,- PA 2068 t 1 t..t g. .a, ,t, ,a, .a a..t ,.a 

TPA-human fu 2104 tataacgacaagctacccctgtacatttccaacccaaattacaaagttaatgtgtatgct 

B.a.*- PA 2128 t . . t . . at. . . .gt.a. . t. .aagt . . t. .c t..g..a a 

TPA-human fu 2X64 gtaaccaaggagaacacaatcatcaatccaagcgagaacggcgataccagcacaaatgga 

B.a.- PA 2188 .,t.,t.,a..a t..t.'.t t . . t- . . , . t . .g t . . t, .C, , c, .g 

TPA-human fu 2224 atcaaaaagatccttatatttagtaaaaaaggctacgagatcggttgaggatcc 

B.a.- PA 2248 g. .a. . tt .a. .c. . . tc t a. .a. a. — 
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Figure llA 

TFA-HumaiL LF 82 gccggcgggcatggggacgttggcatgcatgtgaaagaaaaggagaaaaacaaggacgaa 

B.a.- LF 100 -.g t t..t..a-.t c-.a g-.a t..a..t..g 

TPA-Human LF 142 aacaagcgtaaagacgaagaacgtaataaaacacaggaggaacacttaaaggagatcatg 

B.a.^ LF 160 ..t.-.a.a t a a*.g..t a 

TPA-Kuman LF 202 aagcacatagtaaagattgaggtaaaaggcgaagaggctgtaaagaaggaggcagcagaa 

B.a.- LF 220 ..a t a. .a. .a g..g..a t..a..a 

TPA-Uuman LF 262 aaactgttggagaaggtgccttctgacgtcttagagatgtataaggccatcggcggtaag 

B.a** LF 260 ..g,*ac.t a. .a. .a t..t a, .a* . t. ,a. .a. . . 

TFA-Humaii LF 322 atctatatcgtggacggagacatcactaaacacatatctctcgaagctctctccgaggac 

B.a.- LF 340 ,.a t t. . t . .t. .t, ,a t t.a at,a. .t. .a. .t 

TPA-Ruioan LF 382 aagaaaaagattaaagacatctacgggaaggatgccttattgcacgagcactacgtttac 

B.a.- LF 400 a. .a t. .t a t a. . t . .a. . t. . t . ,a. ,t 

TPA-Human LF 442 gcaaaggagggctatgagcccgtgctcgttattcagagtagtgaggactacgtcgagaat 

B.a.- LF 460 a. .a. .a a a. .t. .a. .c. .atc.tcg. .a. . t. . t. .a, .a. . . 

TPA-Human LF 502 accgagaaagctctgaatgtgtattacgagatcggaaagattctgtcccgggacatcctg 

B.a.- LF 520 .,t..a..g..a c. .t t*.a..a,.t at.a. .aa. . . .t. .tt.a 

TPA-Human LF 562 tccaaaatcaaccagccataccagaaattccttgatgttcttaacacaatcaaaaaogcg 

B.a.- LF 580 agt t..t..a t tt.a — .at.a. . t. .c. .t t..a 

TPA-Human LF 622 tcagatagcgacgggcaggatcttctgtttacaaatcaactcaaggaacaccccactgat 

B.a.- LF 640 . . t. . .tea. . t. .a. .a t.a t g..t t a..c 

TPA-Human LF 6B2 ttcagcgtggagttcctcgagcagaattctaacgaagtccaggaggtgttcgccaaggca 

B.a.- LF 700 . .ttct. ..a. . a. . . t.g. .a. .a. . • age. . t . .g. .a. .a. .a. .a. .t. *9. .a. .t 

TPA-Human LF 742 tttgcgtactatatcgaaccccagcatcgcgatgtgctccagctgtacgccccggaggca 

B.a.- LF 760 a..t g. .a t tt,a t..t..a a. .t 

TPA-Human LF 802 tttaactacatggacaaattcaatgaacaggagattaatctgtctctggaggaactgaaa 

B.a.- LF 820 t t t.,c a. .a. .a a..ct....a t... 

TPA-Human LF 8 62 gaccagac^gatgctctcccggtatgaaaagtgggaaaagatcaaacagcattaccagcat 

B.a.-- LF 980 ..t.,ac g..aa.a a a c,.t..a..c 

TPA-Kuraan LF 922 tggtccgactccctgtcagaagaggggcgcggcctgttgaaaaagttgcagattcccatc 

B.a.- LF 940 ag. . . t. . tt.a. .t a. .aa.a. .a. .t. .a c t..t 

TPA-Human LF 982 gagcctaagaaagatgatataatacactctctaagccaggaggagaaggaactcctgaag 

B.a.- LF 1000 a c t. .t. . .t. . tct. .a. .a. .a. .a. .g. .t. .a. .a 

TPA-Human LF 1042 cggatacaaatcgactcatccgatttccttagcacagaagagaaggagtttctaaaaaaa 

B.a.- LF 1060 a. a t..tagtagt tt.atct. .t. .g. .a. .a t g 

TPA-Human LF 1102 cttcagatagatattagagattcactgagcgaggaagagaaggagctgctcaaccgaatt 

B.a.- LF 1120 .•a..a..t c.t tt.atct..a a. .a tt.a..ta. . « .a 

•TPA'-Kuman LF 1162 caagtcgatagttcgaaccccttgtcagaaaaagagaaggaattcctgaaaaagttgaag 

B.a,- LF 1180 ..g..g agt. . t. . t. .a. . t a. .a. .g. . tt. a c,...a 

TPA-Human LF 1222 ctcgacatccagccgtacgatattaatcagcggctacaagacaccggcggtctgattgat 

B.a.- LF 1240 . . t. .t. . t. .a. .a. .t aa. .t.g. . , . . t . .a. .a. .gt.a 

TPA-Human LF 128 2 agccccagcatcaaccttgacgtacggaagcaatataagcgcgacattcaaaatatcgac 

B.a.- LF 1300 . .t. .gtca, .t. ,t .t...a.a g aa.g.,t t..t 



TPA-Human LF 1342^ gccctattacatcaatccataggctccacgctatacaataaaatctatctatacgaaaac 
B.a.- LF 1360 ..tt t. .aagt. .ct.g t-..t.g..t t 
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Figure IIB 

TPA-Humah LP X402 atgaatattaacaatctcaccgctacactgggagcggacctggtcgatagtacagacaac 

B.a.- LF 1420 c- -t..c..t.,a.»a,.c*.a. .t tt .a. . t. . .tec, . t. .t. .t 

!rPA-Human hF 1462 acaaagataaacagaggtattttcaacgaattcaaaaagaactttaagtattcgatcagc 

B,a,- LF 1480 ..t*.a..t..t t a, • t . . c. . a*. . .agt. . ttct 

TPA-Human tF 1522 agtaactatatgattgttgacatcaatgaacggcccgcattagacaatgagaggttgaag 

B.a.- LF 1540 t..a a....t t c.t a 

TFA-Huioan LF 1582 tggagaattcaactgagtcctgatactagggccggctatctggagaacgggaaactgatc 

B«a«- LF 1600 c. . .t.atca. .a c.a. «a. .a* . .t.a. .a« .t. .a. .g* . t. .a 

TFA-Huxnan LF 1642 ttacagcgaaacatcgggctggagatcaaggatgtgcagattatcaagcagagcgaaaaa 

B.a.- LF 1660 aa t a.. a a..a..a..t ate 

TPA-Huanan LF 1702 gaatacattcgcatcgacgccaaggtggtgcctaagtcaaagatcgataccaagatccag 

B.a.- LF 1720 t^aa.g. . t. .t . .g, ,a. ,a a. . .agt. .a. .a a..a..t..a 

TPA^Ruman LF 1762 gaagctcagctcaacattaaccaggagtggaataaagctcttggtctgccaaaatacacc 

B.a.- LF 1780 a, . .t.a..t. .a. .t a at.a..gt.a t..a 

TPA-Human LP 1B22 aaacttatcacctttaatgtgcacaacaggtatgcctctaatatcgtegagtcagcatac 

B.a.- LF 1840 ..g t..a..c..c t,.t,.a a..c t. .a. ,aagt. . t, . t 

TPA-Huanan LF 1862 ctgattctcaatgaatggaagaacaatattcagtctgacctgatcaagaaggtcacgaat 

B,a,- LF 1900 t.a..at.g a. It aag. . . t. .t. .a. .a a. .a... 

!?PA-Huinan LF 1942 tatctggtggacggaaatggcagattcgtgttcaccgacataactttgecaaacattgcc 

B.a,- LF 1960 , .ct,a. . t . . t. .t a t.,t..t t ► ,t . , .c.c. . t. .t, .a. .t 

TPA-Buman LF 2002 gagcaatacactcatcaggatgaaatttacgagcaagtccactccaaaggtctgtatgtt 

B.a.- LF 2020 ..a t. .a a g..a..t t..t..a gt,a 

TPA-Hxainan LF 2062 ccagagtcaagatcgattctgctccatggtccatccaaaggggttgagcttcgaaacgat 

B.a.- LF 2080 a. -cc. t. . t. ,at,a a..t..a t . . a. -at.aa.g. . t. , . 

TPA-Hisniati LF 2122 tctgagggatttatcgctgactttggagccgctgtggatgactacgccggatacctgttg 

B.a.- LF 2140 ag t acae,.a cat t..t.,t t..a..a 

TPA-Human LP , 2182 gataagaatcagtctgatctcgtgacaaatagcaaaaaattcatagatattttcaaggag 

B.a.- LF 22O0 c.a t.a..t tct t t a 



TPA-Human LF 2242 gaagggagtaaeetgacttcctatggccgcacgaaegaggctgaattttttgcggaagcc 

B.a.- LP 2260 tt.a g ga.a. .a. .t. .a. .g a 

TPA-Human LF 2302 tttagacttatgcacagcaccgaccatgctgaaaggttgaaggtgcaaaagaatgcccct 

B.a.- LF 2320 gt.a ttct..g c.t..a..a-.t a t..g 

TPA-Human LF 2362 aaaaccttccagttcataaatgaccagatcaagttcatcatcaactcttgaggatcc 

B.a.- LF 2380 t a. . t- .t. , c. . t t t..t a. a, 
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Figure 12 



TPA-Bucoan PA 1 mdeuidcrglec^rlllogavfvspsflagptvpdrdndgipddlevegytvdvki^ 

Sugar minus 1 

TPA-Human PA 61 iaaiheUcgltkyksBpekwfltasdpysdfelcvtgrldJcxtv-epearhplvaaypivhvd^ 

Sugar minus 61 q 

TPA-Hunian PA 121 eniilsknedqstqpitdsetrtiakntstsrthtsevhgnas^diasffdiggsvaagCsn 

Sugar minus 121 q 

TPA-Hunan PA 161 snsstvaidhslslagertvaetmglntadtarlnaniryvntgtapiynvlpttslvlg 

Sugar minus 161 .q . 

TPA-Human PA 241 loiqtlatikakenqlsqilapnnyypeknlapialnaqddCsstpitmnynqflelektk 

Sugar minus 241 .q 

TPA-Human PA 301 qlrldtdqvygniatynfengrvrvdtgsnwsevlpqiqettariifngkdlnlvarria 

Sugar minus 301 q 

TPA-Human PA- 361 avnpsdplettkpdmtlkealkiafgfnepngnlqyrqgkditefdfnf dqqtsqniknql 

Sugar minus 361 ,.q 

TPA^Human PA 421 aelnatniytvldkiklnakmnilirdkrfihydraniavgadeswkeahrevinasteg 

Sugar minus 421 ...q q . 

TPA-Human PA 481 lllnidkdirkilsgylveiedkeglkevindrydmlnisslrqdgktfidfkkyndklp 

Sugar minus 481 q 

TPA-Human PA 541 lyisnpnykvnvyavtkentilnpsengdtstnglkkilifskkgyeig 

Sugar minus 541 q 
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Figure 13 

TPA-Human LP l mdairta^lccvlllcgavfvspsagghgdvginhvkekekxi^^ 

Sugar minus 1 q 

TPA-Human LF €1 eiirikhivkievkgeeavkkeaaekllekvpsdvlerai^i^^ 

Sugar minus 61 

TPA*Human LF 121 aedkkkikdiygkdal'lhehyvyakegyepvlviqsaedyvenbekalnvYyeiglcilBr 
Sugar minus 121 , • 

TPA-Human IiP ISl dilskinqpyqkf Idvlntiknafldsdgqdllftnqlkel^tdfevefleqnflnevqevf 
Sugar minua 181 -q 

TPA-Hiaman LF 241 akaf ayyiepqhrdvlqlyapeafnymdkfneqeinleleelkdqrmlsryekwekiki^ 
Sugar mintXa 241 q ♦ 

TPA^Human LF 301 yq^wsdslseegrgllkklqipiepkkddiihalaqeekellkrlqidsBdflateekef 
Sugar minus 301 - 

TPA-Human LF 361 Ikklqidirdslseeekellnriqvdssnplsekekeflkklkldiqpydinqrlqdtgg 
Sugar minus 361 ^ 

TPA-Eximan LF 421 lidspsinldvrkq^rdiqnidallhqsigstlynkiylyeiumiinnltatlgadlvds 
Sugar minus 421 <2t * • • 

TPA-Human LF 481 bdntkinrgifnefkknfkysissnymivdinerpaldnerlkwriqlapdtragyleng 
Sugar minua 481 

TPA'-Huraan LF 541 klilqrnigleikdvqiikqsekeyiridakvi^pkfikidtkiqeaqlninqevnikalglp 
Sugar minus 541 

TPA-Human LF 601 kytklitfnvhnryasnivesaylilnewknniqsdlikkvtnylvdgngrfvf tditlp . 
Sugar minus 601 

TPA-Buman LF 661 niaeqythqdeiyeqvhskglyvpesrsillhgpskgvelrndsegf iadfgaavddyag 
Sugar minus 661 <I 

TPA-Human LF 721 ylldknqsdlvtnskkf idifkeegsnltsygrtneaef faeaf rlmhstdhaerlkvqk 
Sugar minus 721 q q 

TPA-H\xman LF 781 napktfqfindqikfiins 
Sugar minus 781 • 
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Figure 14 

1 gatatcgcca ccatggatgc aatgaagaga 
m d a m k r 

61 gcagtcttcg tttcgcccag cgccggcggg 
avf vsp aagg 

121 aaggagaaaa acaaggacga aaacaagcgt 
kek nkd enkr 

181 gaacacttaa aggagatcat gaagcacata 
ehl kei nkhl 

241 gtaaagaagg aggcagcaga aaaactgttg 
vkk eaa ekll 

301 tataaggcca tcggcggtaa gatctatatc 
yka igg kiyi 

361 ctcgaagctc tctccgagga caagaaaaag 
lea Ise dkkk 

421 ttgcacgagc actacgttta cgcaaaggag 
Ihe hyv yake 

481 agtgaggact acgtcgagaa taccgagaaa 
sed yve ntek 

541 attctgtccc gggacatcet gtccaaaatc 
lis rdi Iski 

601 cttaacacaa tcaaaaacgc gtcagatagc 
Int ikn asds 

661 ctcaaggaac accccactga tttcagcgtg 
Ike hpt dfsv 

721 caggaggtgt tcgccaaggc atttgcgtac 
qev fak afay 

781 cagctgtacg ccccggaggc atttaactac 
qly ape afny 

841 ctgtctctgg aggaactgaa agaccagtga 
Isl eel kdq- 



gggctctgct gtgtgctgct gctgtgtgga 
glc cvl llcg 

catggggacg ttggcatgca tgtgaaagaa 
hgd vgm hvke 

aaagacgaag aacgtaataa aacacaggag 
kde ern ktqe 

gtaaagattg aggtaaaagg cgaagaggct 
vki evk geea 

gagaaggtgc cttctgacgt cttagagatg 
ekv pad ^rlem 

gtggacggag acatcactaa acacatatct 
vdg dit khia 

attaaagaca tctacgggaa ggatgcctta 
ikd iyg kdal 

ggctatgagc ccgtgctcgt tattcagagt 
gye pvl viqs 

gctctgaatg tgtattacga gatcggaaag 
aln vyy eigk 

aaccagccat accagaaatt ccttgatgtt 
'^^IP fldv 

gacgggcagg atcttctgtt tacaaatcaa 
dgq dl'l ftnq 

gagttcctcg agcagaattc taacgaagtc 
efl eqn snev 

tatatcgaac cccagcatcg cgatgtgctc 
yie pqh rdvl 

atggacaaat tcaatgaaca ggagattaat 
mdk £ne qein 

ggatcc 
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SBQUBNCE LXSTIKO 

<110> Vlcal Incorx)orated 

<120> Codon-Optimized Polynucleotlde-BaBed Vaccines 
Against Bacillus anthracis Infection 

<130> 1530.046PC03 

<150> US 60/409,307 
<151> 2002-Od-lO 

<150> tJS 60/419,089 
<151> 2002-10-18 

<160> 76 

<170> Patentin version 3.1 

<210> 1 

<211> 1788 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic coding region for Human TPA/B. anthracis 
antigen fusion protein 

<220> 

<221> CS>S 

<222> {13) . . (1779) 

<223> 

<400> 1 

gatatcgcca cc atg gat- gca atg aag aga ggg etc tgc tgt ^tg ctg ctg 51 
Met Asp Ala Met Lys Arg Qly Leu Cys Cy^ Val Leu Leu 
15 10 

ctg tgt gga gca gtc ttc gtt teg ccc age age get ggg cca act gtg 99 
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Leu Cye Qly Ala Val Phe Val Ser Pro fier Ser AXa Oly Pro Thr Val 
15 20 25 

ccc gac aga gac aat gat gga ate cct gat agt eta gag gtt gag gga 147 
Pro Asp Arg Asp Abu Asp Gly lie Pro Asp Ser Leu Glu Val Glu Oly 
30 35 40 45 

tac acg gta gat gtc aag aac aaa agg act ttt etc teg cct tgg ate 195 
Tyr Thr Val Asp val Lys Asn Lya Arg thr Phe Leu Ser Pro Trp He 
50 55 €0 

tea aat ate cat gag aag aag ggg ctt ace aag tac aag tec tec ccc 243 
Ser Asn He His Glu Lya Lys Gly Leu Thr Lye Tyr Lya Ser Ser Pro 
65 70 75 

gag aag tgg tct acc get tec gat cca tat age gat ttc gag aag gtc 291 
Glu Lys Trp Ser Thr Ala Ser Asp Pro Tyr Ser Asp Phe Glu Lya Val 
80 85 90 

aca ggc egg ate gat aaa aat gtg tct eca gag get aga cac ccc ctg 339 
Thr Gly Arg He Asp Lys Asn Val Ser Pro Glu Ala Arg His Pro Leu 
95 100 105 

gta gca gee tac ccg att gta cac gtg gac atg gag aac ate att eta 387 
Val Ala Ala Tyr Pro lie Val His Val Asp Met Glu Asn He He Leu 
110 115 120 125 

age aaa aac gag gac cag tec aca caa aac act gac tec gag acc cgc 435 
Ser Lys Asn Glu Asp Gin Ser Thr Gin Asa Thr Asp Ser Glu Thr Arg 
130 135 140 

acc ata tet aaa aac ace agt act tea agg acc cac acc tct gaa gtg 483 
Thr Ho Ser Lye Asn Thr Ser Thr Ser Arg Thr His Thr Ser Glu Val 
145 150 155 

cac ggc aat gcg gaa gtc cat gca teg ttt ttc gat att ggt ggc tec 531 
His Gly Asn Ala Glu Val His Ala Ser Phe Phe Asp He Gly Gly Ser 
160 165 170 

gtg tea gee ggc ttt age aat age aac tec teg acg gtt gee att gac 579 
Val Ser Ala Gly Phe Ser Asn Ser Asn Ser Ser Thr Val Ala He Asp 
175 180 185 

cac tea ctg tea tta gca ggt gag agg act tgg get gaa act atg ggt 627 
His Ser Leu Ser Leu Ala Gly Glu Arg Thr Trp Ala Glu Thr Met Gly 
190 195 200 205 

ctg aat acc gee gat acg gee egg etc aac gca aat att egg tac gtc 675 
Leu Asn Thr Ala Asp Thr Ala Arg Leu Asn Ala Asn He Arg Tyr Val 
210 215 220 

aac aca ggg act get ect ata tat aac gtg ctg cct acg aca agt ctt 723 
Asn Thr Gly Thr Ala Pro He Tyr Asn Val Leu Pro Thr Thr Ser Leu 
225 230 235 

gtc ctg ggc aaa aat cag acc etc gca acc att aag gca aag gaa aat 771 
Val Leu Gly Lys Asn Gin Thr Leu Ala Thr He Lys Ala Lys Glu Asn 
240 245 250 

cag ctg age cag ate etc gee cct aac aac tat tat cca tec aaa aat 819 
Gin Leu Ser Gin He Leu Ala Pro Asn Asn Tyr Tyr Pro Ser Lys Asn 
255 260 265 
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tta gcc ccc ata gcc ctg aac gcc cag gac gac ttt tec tct acc ccc 867 
Leu Ala Pro lie Ala lieu Asn Ala Oln Asp Asp Phe 8er Ser Thr Pro 
270 275 280 265 

ata act atg aat tac aat cag ttc ctg gag ctg gaa aag acg aag cag 915 
lie Thr Met Aen Tyr Asn Oln Phe Leu Qlu Leu Olu LyB Thr Lys Oln 
290 295 300 

Ctg aga eta gac acc gat cag gtg tat gga aac ata gcg aca tat aac 963 
Leu Arg Leu Asp Thr Asp Gin Val Tyr Oly Asn lie Ala Thr Tyr Aan 
305 310 315 

ttt gag aac ggc cgc gtg cgc gtc gac act ggg tea aac tgg tct gaa 1011 
Phe Olu Asn Oly Arg Val Arg Val Asp Thr Gly Ser Asn Trp Ser Olu 
320 325 330 

gtt ctg Gcg caa att caa gag aca acc gcc aga att ate ttt aat ggg 1059 
Val Leu Pro Oln lie Oln Olu Thr Thr Ala Arg lie lie Phe Asn Oly 
335 340 345 

aag gac ttg aac ctt gtc gaa cgt aga att gcc gcc gtg aac ccc agt 1107 
Lys Asp Leu Asn Leu Val Olu Arg Arg lie Ala Ala Val Asn Pro Ser 
350 355 360 365 

gat cca etc gag acg act aaa ccg gat atg aca ctg aaa gag get ctg 1155 
Asp Pro Leu Glu Thr Thr Lys Pro Asp Met Thr Leu Lys Olu Ala Leu 
370 375 380 

Eiag att gee ttc gga ttc aac gaa cet aat ggc aat ttg cag tat cag 1203 
Lys lie Ala Phe Gly Phe Asn Glu Pro Asn Oly Asn Leu Oln Tyr Oln 
385 390 395 

ggg aaa gac ate aca gag ttt gat ttc aat ttc gat cag cag act tec 1251 
Gly Lys Asp lie Thr Olu Phe Asp Phe Asn Phe Asp Gin Oln Thr Ser 
400 405 410 

caa aat ate aaa aat cag ttg gea gag ctg aat gcc acc aat ate tac 1299 
Oln Asn lie Lys Asn Gin Leu Ala Glu Leu Asn Ala Thr Asn lie Tyr 
415 420 425 

acg gtt etc gat aaa ate aaa ctt aac gee stag atg aac ata ttg att 1347 
Thr Val Leu Asp Lys lie Lys Leu Asn Ala Lys Met Asn lie Leu lie 
430 435 440 445 

ega gac aaa cgc ttc cac tac gac cgc aac aat ata gcc gta ggc get 1395 
Arg Asp Lys Arg Phe His Tyr Asp Arg Asn Asn He Ala Val Gly Ala 
450 455 460 

gat gag tct gtc gtc aag gag get cat agg gaa gtt ate aac age agt 1443 
Asp Glu Ser Val Val Lys Glu Ala His Arg Olu Val He Asn Ser Ser 
465 470 475 

act gaa ggg ctg tta ctt aat ate gac aag gac att egg aag ate ctg 1491 
Thr Glu Gly Leu Leu Leu Asn He Asp Lys Asp He Az^ Lys He Leu 
480 485 490 

tec ggg tat ate gtg gag ate gag gat acc gag ggc ctg aag gaa gtc 1539 
Ser Oly Tyr He Val Glu He Glu Asp Thr Glu Gly Leu Lys Olu Val 
495 500 505 

att aac gac cgc tat gat atg ctg aac att tee age tta ega cag gac 1587 
He Asn Asp Arg Tyr Asp Met Leu Asn He ffex Ser Leu Arg Gin Asp 
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510 515 520 525 

ggt aag aca ttt att gac ttt aaa aag tat aac gac aag eta ccc ctg 1635 
61y liys Ttir Phe lie Asp Phe Lys Lye Tyr Asn Asp Lys Leu Pro Leu 
530 535 540 

tac att tec aac oca aat tac aaa gtt aat gtg tat get gta acc aag 1683 
Tyr lie Ser Asn Pro Asn Tyr Lya Val Aen val Tyr Ala Val Thr Lys 
545 550 555 

gag aac aca ate ate aat cca age gag aae ggc gat ace age aca aat 1731 
Olu Asn Thr lie lie Asn Pro Ser Qlu Asn Gly Asp Thr Ser Thr Asn 
560 565 570 

gga ate aaa aag ate ett ata ttt agt aaa aaa gge tae gag ate ggt 1779 
Gly lie Lys Lys lie I*eu lie Phe Ser Lys Lya Gly Tyr Glu He Gly 
575 5B0 585 

tgaggatcc 1788 

<210> 2 

<211> 589 

<212> PRT 

<213> Artificial sequence 



<220> 

<223> Human TPA/ B. anthracia antigen fusion protein 
<400> 2 

Met Asp Ala Hat Lye Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 10 15 



Ala Val Phe Val Ser Pro Ser Ser Ala Gly Pro Thr Val Pro Asp Arg 
20 25 30 



Asp Asn Asp Gly He Pro Asp Ser Leu Glu Val Olu Gly Tyr Thr Val 
35 40 45 



Asp Val Lys Asn Lys Arg Thr Phe Leu Ser Pro Trp He Ser Asn He 
50 55 60 



His Glu Lya Lys Gly Leu Thr Lys Tyr Lys Ser Ser Pro Olu Lys Trp 
65 70 75 80 



Ser Thr Ala Ser Asp Pro Tyr Ser Asp Phe Olu Lys Val Thr Gly Arg 
85 90 95 



He Asp Lys Asn Val Ser Pro Glu Ala Arg His Pro Leu Val Ala Ala 
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-5- 

100 105 110 



Tyr Pro lie Val His Val Asp Ket Olu Asn lie lie Leu Ser LyB Asn 
115 120 125 



6lu Asp Gin Ser Thr Gin Asn Thr Asp Ser Glu Thr Arg Thr lie Ser 
130 135 140 



Lys Asn Thr Ser Thr Ser Arg Thr His Thr Ser Glu Val His Oly Asn 
145 150 155 160 



Ala Glu Val His Ala Ser Fhe Phe Asp He Gly Gly Ser Val Ser Ala 
165 170 175 



Gly Phe Ser Asn Ser Asn Ser Ser Thr Val Ala lie Asp His Ser lieu 
180 185 190 



Ser Leu Ala Gly Glu Arg Thr Trp Ala Olu Thr Met Gly Leu Asn Thr 
195 200 205 



Ala Asp Thr Ala Arg Leu Asn Ala Asn He Arg Tyr Val Asn Thr Gly 
210 215 220 



Thr Ala Pro He Tyr Asn Val Leu Pro Thr Thr Ser Leu Val Leu Gly 
225 230 235 240 



Lys Asn Gin Tlir Leu Ala Thr He Lys Ala Lys Glu Asn Gin Leu Ser 
245 250 255 



Gin ile Leu Ala Pro Asn Asn Tyr Tyr Pro Ser Lys Asn Leu Ala Pro 
260 265 270 



He Ala Leu Asn Ala Gin Asp Asp Phe Ser Ser Thr Pro He Thr Met 
275 2B0 285 



Asn Tyr Asn Gin Phe Leu Glu Leu Glu Lys Thr Lys Gin Leu Arg Leu 
290 295 300 



Asp Thr Asp Gin Val Tyr Gly Asn He Ala Thr Tyr Asn Phe Glu Asn 
305 310 315 320 



Gly Arg Val Arg Val Asp Thr Gly Ser Asn Trp Ser Glu Val Leu Pro 
325 330 335 



Gin He Gin Glu Thr Thr Ala Arg Ile lie Phe Asn Gly Lys Asp Leu 
340 345 350 
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Asn Leu Val Glu Keg Arg lie Ala Ala Val Asn Pro Ser Asp Pro lieu 
355 360 365 



Olu Thr Thr Lys Pro Asp Met Tbr Leu Lys Glu Ala Leu Lys lie Ala 
370 375 300 



Pile Qly Phe Asn Olu Pro Asn Gly Asn Leu Gin Tyr Qln Gly Lys Asp 
385 390 395 400 



lie Thr Glu Phe Aap Phe Asn Phe Asp Gin Gin Thr Ser Gin Asn He 
405 410 415 



Lys Asn Gin Leu Ala Glu Leu Asn Ala Thr Asn He Tyr Thr Val Leu 
420 425 430 



Asp Lys lie Lys Leu Asn Ala Lys Met Asn lie Leu He Arg Asp Lys 
435 440 445 



Arg Phe His Tyr Asp Arg Asn Asn He Ala Val Gly Ala Asp Olu Ser 
450 455 460 



Val Val Lys Glu Ala His Arg Glu Val He Asn Ser Ser Thr Glu Gly 
465 470 475 480 



Leu Leu Leu Asn lie Asp Lys Asp He Arg Lys He Leu Ser Gly Tyr 
485 490 495 



He Val Glu He Glu Asp Thr Glu Qly Leu Lys Glu Val He Asn Asp 
500 505 510 



Arg Tyr Asp Met Leu Asn He Ser Ser Leu Arg Qln Asp Gly Lys Thr 
515 520 525 



Phe He Asp Phe Lys Lys Tyr Asn Asp Lys Leu Pro Leu Tyr He Ser 
530 535 540 



Asn Pro Asn ^r Lys Val Asn Val Tyr Ala Val Thr Lys Glu Asn Thr 
545 550 555 560 



He He Asn Pro Ser Glu Asn Gly Asp Thr Ser Thr Asn Gly He Lys 
565 570 575 



Lys He Leu He Phe Ser Lys Lys Gly Tyr Glu He Gly 
580 585 



<210> 3 
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-7- 

<211> 4235 

<212> DNA 

<213> Bacillus anthracls 
<220> 

<221> CDS 

<222> (X604) . . (4098) 
<223> 



<400> 3 
BBQQ C u C C^u 


ca^tcgrbaaa 


uC uc aaacag 


aacgvaaauw ttagacu^cbc 


a wG a b uaaaa 


fin 




craccbuccr 


gacuciiauug 


cauacCbCka utaaggugtu 


^ n 9 V 9 <*T^ ^^rt 

uaauagu wcig 


lift 


aaaagacagt 


tgatgccatt 


acUccagatia 


aaatatagct aaccabaaat 


ucac waaaga 




aaccttgttg 


ttctaaataa 


tgactttgcg 


gattccggaa tagatactgg 


tgagi^tagcb 




ctaattttat 


agtgatttaa 


ctaacaattt 


ataaagcagc ataattcaaa 


ct^ctiutaact: 


300 


gatttttcct 


gaagcatagt 


ataaaagagt 


caaggtcttc tagacttgac 


^cttggaatc 


^ <r A 


attaggaatt 


aacaatatat 


ataatgcgct 


agacagaatc aaattaaatg 


caaaaa tgaa 




tattttagta 


agagatccat 


atcattatga 


taataacggt aatattgtag 


gggttgatga 


480 


ttcatattta 


aaaaacgcat 


ataagcaaat 


acttaattgg tcaagcgatg 


gagtttcttt 


540 


aaatctagat 


gaagatgtaa 


atcaagcact 


atctggatat atgcttcaaa 


taaaaaeiacc 


600 


ttcaaaccac 


ctaacaeiaca 


gcccagttac 


aattacatta gcaggcaagg 


acagtggtgt 


660 


tggagaattg 


tatagagtat 


tatcagatgg 


agcaggattc ctggatttca 


ataagtttga 


720 


tgaaaattgg 


cgatcattag 


tagatcctgg 


tgatgatgtt tatgtgtatg 


ctgttactaa 


780 


agaagatttt 


aatgcagtta 


ctcgagatga 


aaatggtaat atagcgaata 


aattaaaaaa 


640 


caccttagtt 


ttatcgggta 


aaataaetaga 


aataaacata aaaactacaa 


atattaatat 


900 


atttgtagtt 


tttatgttta 


ttatatacct 


cctattttat attabtagta 


gcacagtttt 


960 


tgcaaatcat 


gtaattgtat 


acttatctat 


gtagaggtat cacaacttat 


gaatagtgta 


1020 


ttttattgaa 


cgttggttag 


cttggacagt 


tgtatggata tgcatacttt 


ataacgtata 


1080 


aaatttcacg 


caccacaata 


aaactaattt 


aacaaaaaca aaaacacacc 


taagatcatt 


1140 


cagttctttt 


aataaggagc 


tgcccaccEia 


gctaaaccta aataatcttt 


gtttcacata 


1200 


aggttttttt 


ctaaatatac 


agtgtaagtt 


attgtgaatt taaccagtat 


atattaaaaa 


1260 


tgttttatgt 


taacaaatta 


aattgtaaaa 


cccctcttaa gcatagttaa 


gaggggtagg 


1320 
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ttt^aaattt 


tttcrttaaaa 


1 1 agaa aci&a 


taataaaaaa 


acaaiacctat tttctttcag 


1380 


gttgtttttg 


ggttacaaaa 


caaaaagaaa 


acatgtttca 


aggtacaata attatggttc 


1440 


tttagctttc 


tgtaaaacag 


ccttaatagt 


tggatttatg 


actattaaag ttagtataca 


1500 


gcatacacaa 


tctattgaag 


gatatttata 


atgcaattcc 


ctaaaaatag ttttgtataa 


1560 


ccagttcttt 


tatccgaact 


gatacacgta 


ttttagcata 


atttttaatg tatcttcetaa 


1620 


aacagcttct 


gtgtcctttt 


Gtattaaaca 


tataaattct 


tttttatgtt atatatttat 


1680 


aaeiagttctg 


tttaaaaagc 


caaaaataaa 


taattatctc 


tttttattta tattatattg 


1740 


aaactaaagt 


ttattaattt 


caatataata 


taaatttaat 


tttatacaaa aaggagaacg 


IBOO 


tat atg aaa aaa cga aaa gtg tta ata cca tta 
Met Iiya Lya Arg Iiys Val Leu lie Pro Leu 


atg gca ttg tct acg 
Met Ala Leu Ser Thr 


1848 



15 10 15 

ata tta gtt tea age aca ggt aat tta gag gtg att cag gca gaa gtt 1696 
lie Leu Val Ser Ser Thr Oly Aen Leu Qlu Val lie Qln Ala Qlu Val 
20 25 30 

aaa cag gag aac egg tta tta aat gaa tea gaa tea agt tec cag ggg 1944 
Lya 61n Glu Aan Arg Leu Leu Aen Qlu Ser Glu Ser Ser Ser Qln Qly 
35 40 45 

tta eta gga tac tat ttt agt gat ttg aat ttt caa gca ccc atg gtg 1992 
Leu Leu Qly Tyr Tyr Pbe Ser Asp Leu Aan Phe Qln Ala Fro Met Val 
50 55 60 

gtt acc tct tct act aca ggg gat tta tct att cct agt tct gag tta 2040 
Val Thr Ser Ser Thr Thr Qly Asp Leu Ser He Pro Ser Ser Qlu Leu 
65 70 75 

gaa aat att cca teg gaa aac caa tat ttt eaa tct get att tgg tea 2006 
Olu Asn He Pro Ser Qlu Aan Gin Tyr Phe Gin Ser Ala He Trp Ser 
80 65 90 95 

gga ttt ate aaa gtt aag ug agt gat gaa tat aca ttt get act tec 2136 
Oly Phe He Lys Val Lys Lys Ser Asp Qlu Tyr Thr Phe Ala Thr Ser 
100 105 110 

get gat aat cat gta aca atg tgg gta gat gac caa gaa gtg att aat 2184 
Ala Asp Asn Hia Val Thr Met Trp Val Asp Asp Qln Qlu Val He Aan 
115 120 125 

aaa get tct aat tct aac aaa ate aga tta gaa aaa gga aga tta tat 2232 
Lys Ala Ser Asn Ser Asn Lys He Arg Leu Olu Lys Qly Ax^ Leu Tyr 
130 135 140 

caa ata aaa att eaa tat eaa cga gaa aat cct act gaa aaa gga ttg 2280 
QXn He Lys He Qln Tyr Gin Arg Glu Asn Pro Thr Qlu Lys Qly Leu 
145 150 155 

gat ttc aag ttg tac tgg acc gat tct caa aat aaa aaa gaa gtg att 2328 
Asp Phe Lys Leu Tyr Trp Thr Asp Ser Qln Aan Lys Lys Qlu Val He 
160 165 170 175 

tct agt gat aac tta caa ttg cca gaa tta aaa caa aaa tct teg aac 2376 
Ser Ser Asp Asn Leu Qln Leu Pro Qlu Leu Lys Qln Lys Ser ser Asn 
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180 185 190 

tea aga aaa aag cga agt aca agt get gga cct acg gtt cca gac cgt 2424 
Ser Arg Lys Lys Arg Ser Thr Ser Ala Qly Pro Tlur Val Pro Aap Arg 
195 200 205 

gac aat gat gga ate cet gat tea tta gag gta gaa gga tat acg gtt 2472 
Aap Abu Asp Oly lie Pro Asp Ser Leu Glu Val Olu Oly Tyr Thr Val 
210 215 220 

gat gtc aaa aat aaa aga act ttt ctt tea cca tgg att tct aat att 2520 
Asp Val Lys Asn Lys Arg Thr Phe Leu Ser Pro Trp lie Ser Asn lie 
225 230 235 

cat gaa aag aaa gga tta ace aaa tat aaa tea tct cct gaa aaa tgg 2568 
His Glu Lys Lys Qly Leu Thr Lys Tyr Lye Ser Ser Pro Glu Lys Trp 
240 245 250 255 

age acg get tct gat ccg tac agt gat tte gaa aag gtt aca gga egg 2616 
Ser Thr Ala Ser Asp Pro Tyr Ser Asp Phe Glu Lys Val Thr Gly Arg 
260 265 270 

att gat aag aat gta tea cca gag gca aga cac ccc ctt gtg gca get 2664 
He Asp Lys Asn Val Ser Pro Glu Ala Arg His Pro Leu Val Ala Ala 
275 280 285 

tat ccg att gta cat gta gat atg gag aat att att etc tea aaa aat 2712 
Tyr Pro He val His Val Asp Met Glu Asn He He Leu Ser Lys Asn 
290 295 300 

gag gat caa tec aca cag aat act gat agt gaa acg aga aca ata agt 2760 
Glu Asp Gin Ser Thr Gin Asn Thr Asp Ser Glu Thr Arg Thr He Ser 
305 310 315 

aaa aat act tct aca agt agg aca cat act agt gaa gta cat gga aat 2 BOS 
Lys Asn Tlir Ser Thr Ser Arg Thr His Thr Ser Glu Val His Gly Asn 
320 325 330 335 

gca gaa gtg eat gcg teg ttc ttt gat att ggt ggg agt gta tct gca 2856 
Ala Glu Val His Ala Ser Phe Phe Asp He Gly Gly Ser Val Ser Ala 
340 345 350 

gga ttt agt aat teg aat tea agt acg gtc gca att gat cat tea eta 2904 
Gly Phe Ser Asn Ser Asn Ser Ser Thr Val Ala He Asp His Ser Leu 
355 360 365 

tct eta gca ggg gaa aga act tgg get gaa aca atg ggt tta aat acc 2952 
Ser Leu Ala Gly Glu Arg Thr Trp Ala Glu Thr Met Gly Leu Asn Thr 
370 375 380 

get gat aca gca aga tta aat gcc aat att aga tat gta aat act ggg 3000 
Ala Asp Thr Ala Arg Leu Asn Ala Asn He Arg Tyr Val Asn Thr Gly 
385 390 395 

acg get cca ate tac aac gtg tta cca acg act teg tta gtg tta gga 3048 
Thr Ala Pro He Tyr Asn Val Leu Pro Thr Thr Ser Leu Val Leu Gly 
400 405 410 415 

aaa aat caa aca etc gcg aca att aaa get aag gaa aac caa tta agt 3096 
Lys Asn Gin Thr Leu Ala Thr He Lys Ala Lys Glu Asn Gin Leu Ser 
420 425 430 
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caa ata ctt gca cct aat aat tat tat cct tct aaa aac ttg gcg cca 3144 
Oln lie Leu Ala Pro Asn Aan Tyr Tyx Pro Ser Lya Asn Leu Ala Pro 
435 440 445 

ate gca tta aat gca caa gac gat ttc agt tct act cca att aca atg 3192 
He Ala Leu Asn Ala Oln Aap Asp Phe Ser Ser Thr Pro lie Thr Met 
450 455 460 

aat tac aat caia ttt ctt gag tta geia aaa acg aaa caa tta aga tta 3240 
Asn Tyr Asn Oln Phe Leu Olu Leu Olu Lys Thr Lya Oln Leu Arg Leu 
465 470 475 

gat acg gat caa gta tat ggg aat ata gca aca tac aat ttt gaa aat 3288 
Aap Thr Asp Oln Val Tyr Oly Asn He Ala Thr Tyr Asn Phe Olu Asn 
480 485 490 495 

gga aga gtg agg gtg gat aca ggc teg aac tgg agt gaa gtg tta ccg 3336 
Gly Arg Val Arg Val Asp Thr Oly Ser Asn Trp Ser Olu Val Leu Pro 
500 505 510 

caa att caa gaa aca act gca cgt ate att ttt aat gga aaa gat tta 3384 
Oln He Oln Olu Thr Thr Ala Arg He He Phe Asn Oly Lys Asp Leu 
515 520 525 

aat ctg gta gaa agg egg ata gcg gcg gtt aat cct agt gat cca tta 3432 
Asn Leu Val Olu Arg Arg He Ala Ala Val Asn Pro Ser Asp Pro Leu 
530 535 540 

gaa acg act aaa ccg gat atg aca tta aaa gaa gee ctt aaa ata gca 3480 
Olu Thr Thr Lys Pro Asp Met Thr Leu Lys Olu Ala Leu Lys He Ala 
545 550 555 

ttt gga ttt aac gaa ccg aat gga aac tta caa tat caa ggg aaa gac 3528 
Phe Oly Phe Asn Olu Pro Asn Oly Asn Leu Gin Tyr Oln Oly Lys Asp 
560 565 570 575 

ata acc gaa ttt gat ttt aat ttc gat caa caa aca tct caa aat ate 3576 
He Thr Olu Phe Asp Phe Asn Phe Asp Qln Oln Thr Ser Oln Asn He 
580 585 590 

aag aat cag tta gcg gaa tta aac gca act aac ata tat act gta tta 3624 
Lys Asn Gin Leu Ala Olu Leu Asn Ala Thr Asn He Tyr Thr Val Leu 
595 600 605 

gat aaa ate aaa tta aat gca aaa atg siat att tta ata aga gat aeia 3672 
Asp Lys He Lys Leu Asn Ala Lys Met Asn He Leu He Arg Asp Lys 
610 615 620 

cgt ttt cat tat gat aga aat aac ata gca gtt ggg gcg gat gag tea 3720 
Arg Phe His Tyr Asp Arg Asn Asn He Ala Val Oly Ala Asp Olu Ser 
625 630 635 

gta gtt aag gag get cat aga gaa gta att aat teg tea aca gag gga 3768 
Val Val Lys Olu Ala His Arg Olu Val He Asn Ser Ser Thr Glu Oly 
640 645 650 655 

tta ttg tta aat att gat aag gat ata aga aaa ata tta tea ggt tat 3816 
Leu Leu Leu Asn He Asp Lys Asp He Arg Lys He Leu Ser Gly Tyr 
660 665 670 

att gta gaa att gaa gat act gaa ggg ctt aaa gaa gtt ata aat gac 3864 
He Val Glu He Olu Asp Thr Olu Oly Leu Lys Glu Val He Asn Asp 
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675 680 685 

aga tat gat atg ttg aat att tct agt tta egg caa gat gga aaa aca 3912 
Arg Tyr Asp Met Leu Asn lie Ser Ser Leu Arg Oln Asp Qly hys Thx 
690 695 700 

ttt ata gat ttt aaa aaa tat aat gat aaa tta ccg tta tat ata agt 3960 
Phe lie Asp Phe Lya Lye Tyr Asn Asp Lys Leu Pro Leu Tyr lie Ser 
705 710 715 

giat ccc aat tat aag gta aat gta tat get gtt act aaa gaa aac act 4008 
Asn Pro Aan Tyr Lys Val Asn Val Tyr Ala Val Thr Lys Glu Asn Thr 
720 725 730 735 

att att aat cct agt gag aat ggg gat act agt acc aac ggg ate aag 4056 
lie lie Asn Pro Ser Glu Asn Oly Asp Thr Ser Thr Asn Gly lie Lys 
740 745 750 

aaa att tta ate ttt tct aaa aaa ggc tat gag ata gga taa 4098 
Lys lie Leu lie Phe Ser Lya Lya Gly Tyr Glu lie Oly 
755 760 

ggtaattcta ggtgattttt aaattatcta aaaaaeagta aaattaaaac atactctttt 4158 

tgtaeigaaat acaaggagag tatgttttaa acagtaatct aaatcatcat aatectttga 4218 

gattgtttgt aggatce 4235 

<210> 4 

<211> 764 

<212> PRT 

<23,3> Bacillus anthracis 



<400> 4 

Met Lys Lys Arg Lys Val Leu lie Pro Leu Met Ala Leu Ser thr lie 
15 10 15 



Leu Val Ser Ser Thr Gly Asn Leu Glu Val lie Gin Ala Glu Val Lys 
20 25 30 



Gin Glu Asn Arg Leu Leu Asn Glu Ser Glu Ser Ser Ser Gin Gly Leu 
35 40 45 



Leu Gly Tyr Tyr Phe Ser Asp Leu Asn Phe Gin Ala Pro Met Val Val 
50 55 60 



Thr Ser Ser Thr Thr Gly Asp Leu Ser He Pro Ser Ser Glu Leu Glu 
65 70 75 80 



Asn lie Pro Ser Glu Asn Gin Tyr Phe Gin Ser Ala lie Trp Ser Gly 
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90 



95 



Phe lie Lys Val Lys Itye Sex Asp Glu Tyr Tbr Phe Ala Thr Ser Ala 
100 105 110 



Asp Ash His Val Thr Met Trp Val Asp Asp Gin Glu Val lie Asn Itys 
115 120 125 



Ala Ser Asn Ser Asn Lys lie Arg Leu Glu Lys Gly Arg Leu Tyr Gin 
130 135 140 



lie Lys lie Gin Tyr Gin Arg Glu Asn Pro Thr Glu Lys Gly Leu Asp 
145 150 155 160 



Phe Lys Leu Tyr Trp Thr Asp Ser Gin Asn Lys Lys Glu Val lie Ser 
165 170 175 



Ser Asp Asn Leu Gin Leu Pro Glu Leu Lys Gin Lys Ser Ser Asn Ser 
180 185 190 



Arg Lys Lys Arg Ser Thr Ser Ala Gly Pro Thr Val Pro Asp Arg Asp 
195 200 205 



Asn Asp Gly He Pro Asp Ser Leu Glu Val Glu Gly Tyr Thr Val Asp 
210 215 220 



val Lys Asn Lys Arg Thr Phe Leu Ser Pro Trp He Ser Asn He His 
225 230 235 240 



Glu Lys Lys Gly Leu Thr Lys Tyr Lys Ser Ser Pro Glu Lys Trp Ser 
245 250 255 



Thr Ala Ser Asp Pro Tyr Ser Asp Phe Glu Lys Val Thr Gly Arg He 
260 t 265 270 



Asp Lys Asn Val Ser Pro Glu Ala Arg His Pro Leu Val Ala Ala Tyr 
275 280 285 



Pro He Val His Val Asp Met Glu Asn He He Leu Ser Lys Asn Glu 
290 295 300 



Asp Gin Ser Thr Gin Asn Thr Asp Ser Glu Thr Arg Thr He Ser Lys 
305 310 315 320 



Asn Thr Ser Thr Ser Arg Thr His Thr Ser Glu Val His Gly Asn Ala 
325 330 335 
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Glu Val Hl6 Ala 6er Fhe Plie Asp lie Gly oly ser Val Ser Ala Gly 
340 345 350 



Phe Ser Asn Ser Asn Ser Ser Thr Val Ala lie Asp His Ser Leu Ser 
355 360 365 



Leu Ala Gly Glu Arg Thr Tip Ala Glu Thr Met Gly Leu Aan Thr Ala 
370 375 380 



Asp Thr Ala Arg Leu Asn Ala Asn lie Arg Tyr Val Asn Thr Gly Thr 
385 390 395 400 



Ala Pro lie Tyr Aan Val Lau Pro Thr Tiir Ser Leu Val Leu Gly Lys 
405 410 415 



Asn Gin Thr Leu Ala Thr Xle Lys Ala Lys Glu Asn Gin Leu Ser Gin 
420 425 430 



lie Leu Ala Pro Asn Asn Tyr Tyr Pro Ser Lys Asn Leu Ala Pro lie 
435 440 445 



Ala Leu Asn Ala Gin Asp Asp Phe Ser Ser Thr Pro lie Thr Met Asn 
450 455 460 



Tyr Asn Gin Phe Leu Glu Leu Glu Lys Thr Lys Gin Leu Arg Leu Asp 
465 470 475 480 



Thr Asp Gin Val Tyr Gly Asn lie Ala Thr Tyr Asn Phe Glu Asn Gly 
485 490 495 



Arg Val Arg Val Asp Thr Gly Ser Asn Txp Ser Glu Val Leu Pro Qln 
500 505 510 



He Gin Glu Thr Thr Ala Arg He He Phe Asn Gly Lys Asp Leu Asn 
5X5 520 525 



Leu Val Glu Arg Arg He Ala Ala Val Asn Pro Ser Asp Pro Leu Glu 
530 535 540 



Thr Thr Lya Pro Asp Met Thr Leu Lys Glu Ala Leu Lys He Ala Phe 
545 550 555 560 



Gly Phe Asn Glu Pro Asn Gly Asn Leu Gin Tyr Gin Gly Lys Asp He 
565 570 575 



Thr Glu Phe Asp Phe Asn Phe Asp Gin Gin Thr Ser Gin Asn He Lys 
580 585 590 
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Aen Gin Leu Ala Glu Leu Asn Ala Tlxr Asn lie Tyr Thr Val Leu ABp 
595 600 605 



Lys lie Lys Leu Asn Ala Lys Met Asn lie Leu He Arg Aep Lys Arg 
610 615 620 



Phe His Tyr Asp Arg Aan Aan He Ala Val Gly Ala Aap Glu Ser Val 
625 630 635 640 



Val Lye Glu Ala His Arg Glu Val lie Asn Ser Ser Thr Glu Gly Leu 

645 650 655 



Leu Leu Asn He Asp Lys Asp lie Arg Lys He Leu Ser Qly Tyr lie 
660 665 670 



Val Glu He Glu Asp Thr Glu Gly Leu Lys Glu Val He Asn Asp Arg 
675 680 685 



Tyr Asp Met Leu Asn He Ser Ser Leu Arg Gin Asp Gly Lys Thr Phe 
690 695 700 



He Asp Phe Lye Lys Tyr Asn Asp Lys lieu Fro Leu Tyr He Ser Asn 
705 710 715 720 



Pro Asn Tyr Lys Val Asn Val Tyr Ala Val Thr Lys Glu Aen Thr He 
725 730 735 



He Asn Pro Ser Glu Asn Gly Asp Thr Ser Thr Asn Gly He Lys Lys 
740 745 750 



He Leu He Phe Ser Lys Lys Gly Tyr Glu He Gly 
755 760 



<210> 5 

<211> 1782 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic coding region for Human TPA/ synthetic 
antigen fusion protein 

<220> 
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<221> CDS 

<222> (13) .. (1773) 

<223> 



<400> 5 

gatatcgcca cc atg gat gca atg aag aga ggg etc tgc tgt gtg ctg ctg 51 
Met Asp Ala Met Lys Arg Gly Leu Cys Cya Val Leu Leu 
15 10 

ctg tgt gga gca gtc ttc gtt teg ccc ago age get ggg oca act gtg 99 
Leu Cya Gly Ala Val Phe Val Ser Pro Ser Ser Ala Gly Pro Thr Val 
15 20 25 

ccc gac aga gac aat gat gga ate cot gat agt eta gag gtt gag gga 147 
Pro Asp Arg Asp Aan Asp Gly lie Pro Asp Ser Leu Giu Val Glu Gly 
30 35 40 45 

tac acg gta gat gtc aag aac aaa agg act ttt etc teg cct tgg ate 195 
Tyr Thr Val Asp Val Lyfl Asn Lys Arg Thr Pho Leu Ser Pro Trp lie 
50 55 60 

tea aat ate cat gag aag aag ggg ctt acc aag tac aag tec tec ccc 243 
Seac Asn He His Olu Lys Lye Gly Leu Thr Lys Tyr Lya Ser Ser Pro 
65 70 75 

gag aag tgg tot acc get tec gat cca tat age gat ttc gag aag gtc 291 
Glu Lys Trp Ser Thr Ala Ser Asp Pro Tyr Ser Asp Phe Glu Lys Val 
80 85 dO 

aca ggc egg ate gat aaa aat gtg tct cca gag get aga cac ccc ctg 339 
Thr Gly Arg He Asp Lya Asn Val Ser Pro Glu Ala Arg His Pro Leu 
95 100 105 

gta gca gee tac ccg att gta cac gtg gac atg gag aac ate att eta 387 
Val Ala Ala Tyr Pro lie Val His Val Asp Met Glu Asn He He Leu 
110 115 120 125 

age aaa aac gag gac cag tec aca caa aac act gac tec gag acc cgc 435 
Ser Lys Asn Glu Asp Gin Ser Thr Gin Asn Thr Asp Ser Glu Thr Arg 
130 135 140 

acc ata tet aaa aac acc agt act tea agg acc cac ace tet gaa gtg 483 
Thr He Ser Lya Asn Thr Ser Thr Ser Arg Thr His Thr Ser Glu Val 
145 150 155 

cac ggc aat gcg gaa gtc cat gca teg gat att ggt ggc tec gtg tea 531 
His Gly Asn Ala Glu Val His Ala Ser Asp He Gly Gly Ser Val Ser 
160 165 170 

gee ggc ttt age aat age aac tec teg acg gtt gee att gac cac tea 579 
Ala Gly Phe Ser Asn Ser Asn Ser Ser Thr Val Ala He Asp His Ser 
175 180 165 

ctg tea tta gca ggt gag agg act tgg get gaa act atg ggt ctg aat 627 
Leu Ser Leu Ala Gly Glu Arg Thr Trp Ala Glu Thr Met Gly Leu Asn 
190 195 200 205 

acc gee gat acg gee egg etc aac gca aat att egg tac gtc aac aca 675 
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Tfar Ala Asp Thr Ala Arg Leu Asn Ala Asn lie Arg Tyr Val Asn Thr 
210 215 220 

999 act get cct ata tat aac gtg ctg cct acg aca agt ctt gtc ctg 723 
Oly Thr Ala Pro lie Tyr Aan Val Leu Pro Thr Thr Ser hen val Leu 
225 230 235 

ggc aaa aat cag acc etc gca acc att aag gca aag gaa aat cag ctg 771 
Gly LyB Asn Qlzi Thr Leu Ala Thr lie Lys Ala Lys Glu Asn Gin Leu 
240 245 250 

age cag ate etc gcc cct aa.c aac tat tat cca tec aaa aat tta gcc 819 
Ser Qln He Leu Ala Pro Asn Aan Tyr Tyr Pro 6er Lys Asn Leu Ala 
255 260 265 

ccc ata gcc ctg aac gcc cag gac gac ttt tec tct acc ccc ata act B67 
Pro He Ala Leu Aan Ala Gin Asp Asp Phe Ser Ser Thr Pro He Thr 
270 275 280 285 

atg aat tac aat cag ttc ctg gag ctg gaa aag acg aag cag ctg aga 915 
Met Asn Tyr Aan Gin Phe *3Lieu Qlu Leu Glu Lys Thr Lya Gin Leu Arg 
290 295 300 

eta gac acc gat cag gtg tat gga aac ata gcg aca tat aac ttt gag 963 
Leu Asp Thr Asp Gin Val Tyr Gly Asn lie Ala Thr Tyr Aan Phe Glu 
305 310 315 

aac ggc cgc gtg cgc gtc gac act ggg tea aac tgg tct gaa gtt ctg 1011 
Asn Gly Arg Val Arg Val Asp Thr Gly Ser Asn Trp Ser Glu Val Leu 
320 325 330 

ccg caa att caa gag aca acc gcc aga att ate ttt aat ggg aag gac 1059 
Pro Gin He Qln Qlu Thr Thr Ala Arg He He Phe Aan Gly Lys Asp 
335 340 345 

ttg aac ctt gtc gaa cgt aga att gcc gcc gtg aac ccc agt gat cca 1107 
Leu Asn Leu Val Glu Arg Arg He Ala Ala Val Aan Pro Ser Asp Pro 
350 355 360 365 

etc gag acg act aaa cdg gat atg aca ctg aaa gag get ctg aag att 1155 
Leu Glu Thr Thr Lys Pro Asp Met Thr Leu Lys Glu Ala Leu Lys He 
370 375 380 

gcc ttc gga ttc aac gaa cct aat ggc aat ttg cag tat cag ggg aaa 1203 
Ala Phe Oly Phe Asn Glu Pro Aan Gly Asn Leu Gin Tyr Gin Gly Lys 
385 390 395 

gac ate aca gag ttt gat ttc aat ttc gat cag cag act tec caa aat 1251 
Asp He Thr Glu Phe Asp Phe Asn Phe Asp Gin Gin Thr Ser Gin Asn 
400 405 410 

ate aaa aat cag ttg gca gag ctg aat gcc acc aat ate tac acg gtt 1299 
He Lys Asn Gin Leu Ala Glu Leu Asn Ala Thr Asn He Tyr Thr Val 
415 420 425 

etc gat aaa ate aaa ctt aac gcc aag atg aac ata ttg att cga gac 1347 
Leu Asp Lys He Lys Leu Asn Ala Lys Met Asn He Leu He Arg Asp 
430 435 440 445 

aaa cgc ttc cac tac gac cgc aac aat ata gcc gta ggc get gat gag 1395 
Lys Arg Phe His Tyr Asp Arg Asn Asn He Ala Val Gly Ala Asp Glu 
450 455 460 
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tct gtc gtc aag gag get cat agg gaa gtt ate aac age agt act gaa 1443 
Ser Val Val Lys Glu Ala Hia Arg Olu Val lie Asn Ser Ser Thr Glu 
465 470 475 

ggg ctg tta ctt aat ate gac aag gac att egg aag ate etg tec ggg 1491 
Oly Leu Leu Leu Asn He Asp Lys Asp He Arg Lys He Leu Ser Oly 
480 485 490 

tat ate gtg gag ate gag gat ace gag gge ctg aag gaa gtc att aac 1539 
Tyr He Val Glu He Glu Asp Thr Glu Gly Leu Lys Glu Val He Asn 
495 500 505 

gac cgc tat gat atg ctg aac att tec age tta cga cag gac ggt aag 1587 
Asp Arg Tyr Asp Met I^eu Asn He 8er Ser Leu Arg Gin Asp Oly Lys 
510 515 520 525 

aca ttt att gac ttt aaa aag tat aac gac aag eta ccc ctg tac att 1635 
Thr Phe He Asp Phe Lys Lys Tyr Asn Asp Lys Leu Pre Leu Tyr He 
530 535 540 

tec aac cca aat tac aaa gtt aat gtg tat get gta acc aag gag aac 1683 
Ser Asn Pro Asn Tyr Lys Val Asn Val Tyr Ala Val Thr Lys Glu Asn 
545 550 555 

aca ate ate aat cca age gag aac ggc gat acc age aca aat gga ate 1731 
Thr He He Asn Pro Ser Glu Asn Gly Asp Thr Ser Thr Asn Gly He 
560 565 570 

aaa aag ate ctt ata ttt agt aaa aaa ggc tac gag ate ggt tgaggatcc 1782 
Lys Lys He Leu He Phe Ser Lys Lys Gly Tyr Glu He Gly 
575 580 585 

<210> 6 

<211> 587 

<212> PRT 

<213> Artificial 6eq[uence 



<a20> 

<223> Human TPA/synthetic antigen fusion protein 
<400> 6 

Met Asp Ala Met Lys Arg Gly Leu Cye Cys Val Leu Leu Leu Cys Gly 
1 5 10 15 



Ala Val Phe val Ser Pro Ser Ser Ala Gly Pro Thr Val Pro Asp Arg 
20 25 30 



Asp Asn Asp Gly He Pro Asp Ser Leu Glu Val Glu Gly Tyr Thr Val 
35 40 45 



Asp Val Lys Asn Lys Arg Thr Phe Leu Ser Pro Tip He Ser Asn Ha 
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50 



55 



60 



hIb Glu Lye Lye Gly lieu TJir Lys Tyr Lye Ser Ser Pro Glu Lye Trp 
65 70 75 80 



Ser Thr Ala Ser Asp Pro Tyr Ser Asp Phe Qlu Lye Val Thr Gly Arg 
85 90 95 



lie Asp LyB Aon Val Ser Pro Glu Ala Arg Hie Pro lieu Val Ala Ala 
100 105 110 



Tyr Pro lie Val His Val Asp Met Qlu Asn lie He Leu Ser LyB Asn 
115 120 125 



Glu Asp Gin Ser Thr Gin Asn Thr Asp Ser Glu Thr Arg Thr He Ser 
130 135 140 



Lys Asn Thr Ser Thr Ser Arg Thr His Thr Ser Glu Val Hie Gly Asn 
X45 150 155 160 



Ala Glu Val His Ala Ser Asp He Gly Gly Ser Val Ser Ala Gly Phe 
165 170 175 



Ser Asn Ser Asn Ser Ser Thr val Ala He Asp His Ser Leu Ser Leu 
180 185 190 



Ala Gly Glu Arg Thr Trp Ala Glu Thr Met Gly Leu Asn Thr Ala Asp 
195 200 205 



Thr Ala Arg Leu Asn Ala Asn He T^g Tyr Val Asn Thr Gly Thr Ala 
210 215 220 



Pro He Tyr Asn Val Leu Pro Thr Thr Ser Leu Val Leu Gly Lys Asn 
225 230 235 240 



Gin Thr Leu Ala Thr He Lys Ala Lys Glu Asn Gin Leu Ser Gin He 
245 250 255 



Leu Ala Pro Asn Asn Tyr Tyr Pro Ser Lys Asn Leu Ala Pro He Ala 
260 265 270 



Leu Asn Ala Oln Asp Asp Phe Ser Ser Thr Pro He Thr Met Asn Tyr 
275 280 285 



Asn Gin Phe Leu Glu Leu Qlu Lys Thr Lys Gin Leu Arg Leu Asp Thr 
290 295 300 
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Asp Oln Val Tyr Gly Asn lie Ala Thr Tyr Asn Phe Qlu Aan Qly Arg 
305 310 315 320 



Val Arg Val Asp Thr Gly Ser Asn Trp Ser Glu Val Leu Pro Gin lie 
325 330 335 



Gin Glu Thr Thr Ala Arg lie He Phe Asn Gly Lys Asp Leu Aan Leu 
340 345 350 



Val Glu Arg Arg He Ala Ala Val Asn Pro Ser Asp Pro Leu Glu Thr 
355 360 365 



Thr Lys Pro Asp Met Thr Leu Lye Glu Ala Leu Lys He Ala Phe Gly 
370 375 380 



Phe Asn Glu Pro Aan Qly Asn Leu Gin Tyr Qln Gly Lys Asp He Thr 
365 330 395 400 



Glu Phe Asp Phe Asn Phe Asp Oln Gin Thr Ser Gin Asn He Lys Asn 
405 410 415 



Gin Leu Ala Glu Leu Aan Ala Thr Asn He Tyr Thr Val Leu Asp Lye 
420 425 430 



He Lys Leu Asn Ala Lys Met Asn lie Leu He Arg Asp Lys Arg Phe 
435 440 445 



His Tyr Asp Arg Asn Asn He Ala Val Gly Ala Asp Glu Ser Val Val 
450 455 460 



Lys Glu Ala His Arg Glu Val He Asn Ser Ser Thr Glu Gly Leu Leu 
465 470 475 480 



Leu Asn He Asp Lys Asp He Arg Lys He Leu Ser Gly Tyr He Val 
485 490 495 



Glu He Glu Asp Thr Glu Gly Leu Lys Glu Val He Asn Asp Arg Tyr 
500 505 5X0 



Asp Met Leu Asn He Ser Ser Leu Asrg Gin Asp Gly Lys Thr Phe He 
515 520 525 



Asp Phe Lys Lys Tyr Asn Asp Lys Leu Pro Leu Tyr He Ser Asn Pro 
530 535 540 



Asn Tyr Lys Val Asn Val Tyr Ala Val Thr Lys Glu Asn Thr He He 
545 550 555 560 
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Asn Pro Ser Glu Asn Oly Asp Thr Ser Thr Asn Gly He Lys hye He 
565 570 575 



Leu He Phe 6er Lys Lys Gly Tyr Glu He Gly 
580 585 



<210> 7 

<211> 2277 

<212> SNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic coding region for Human TPA/syntiietic 
antigen fusion protein 

<220> 

<221> CDS 

<222> (13) (2268) 

<223> 



<400> 7 

gatatcgcca cc atg gat gca atg aag aga ggg etc tgc tgt gtg ctg ctg 51 
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu 
15 10 

Ctg tgt gga gca gtc ttc gtt teg ccc age gaa gtg aag caa gaa aat 99 
Leu Cys Gly Ala Val Phe Val Ser Pro Ser Glu Val Lys Gin Glu Asn 
15 20 25 

cga ctt ctg aac gag age gaa agt tea tea cag ggt ett etc gga tac 147 
Arg Leu Leu Asn Glu Ser Glu Ser Ser Ser Gin Gly Leu Leu Gly Tyr 
30 35 40 45 

tac ttc agt gae ttg aat ttc caa gca cca atg gtg gtg act agt age 195 
Tyr Phe Ser Asp Leu Asn Phe Gin Ala Pro Met Val Val Thr Ser Ser 
50 55 60 

ace acc ggc gat ttg age att ccc age tct gag ttg gag aac att ccc 243 
Thr Thr Gly Asp Leu Ser He Pro Ser Ser Glu Leu Glu Asn He Pro 
65 70 75 

age gaa aat cag tac ttc cag tct get ate tgg tec gga ttc att aag 291 
Ser Glu Asn Gin Tyr Phe Gin Ser Ala He Trp Ser Gly Phe He Lys 
80 85 90 

gtt aaa aag tec gae gaa tat aca ttt get acc teg gcg gat aac cat 339 
Val Lys Lys Ser Asp Glu Tyr Thr Phe Ala Thr Ser Ala Asp Asn His 
95 100 105 
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gtg aca atg tgg gtg gac gac cag gaa gtg ate a^c aag get tea aac 387 
Val Thr Met Tzp Val Asp Asp Qln Glu Val lie Aan Lys Ala Ser Asn 
110 115 120 125 

tct aat aaa ate egg etc gag aag ggg agg etc tac cag ate aaa att 435 
Ser Aan Lys He Arg Leu Glu Lys Gly Arg Leu Tyr Gin He Lys He 
130 135 140 

cag tac cag egg gaa aac cct aca gaa aaa gga etc gat ttc aag ctg 483 
Gin Tyr Gin Arg Glu Asn Pro Thr Glu Lys Gly Leu Asp Fhe Lys Leu 
. 145 150 155 

tae tgg aca gat age caa aac aag aaa gaa gtt ate age tea gac aat 531 
Tyr Trp Thr Asp Ser Gin Asn Lys Lys Glu Val He Ser Ser Asp Asn 
160 165 170 

ctg cag tta eec gag etc aag cag Eiag agt tct aat aca age get ggg 579 
Leu Gin Leu Pro Glu Leu Lys Gin Lys Ser Ser Asn Thr Ser Ala Gly 
175 100 165 

cea act gtg ccc gac aga gac aat gat gga ate cet gat agt eta gag 627 
Pro Thr Val Pro Asp Arg Asp Asn Asp Gly He Pro Asp Ser Leu Glu 
190 195 200 205 

gtt gag gga tac acg gta gat gtc aag aac aaa agg act ttt etc teg 675 
Val Glu Gly Tyr Thr Val Asp Val Lys Asn Lys Arg Thr Phe Leu Ser 
210 215 220 

cct tgg ate tea aat ate cat gag aag aag ggg ctt aec aag tac aag 723 
Pro T3:p He Ser Asn He His Olu Lys Lys Gly Leu Thr Lys Tyr Lys 
225 230 235 

tec tec ccc gag aag tgg tot ace get tec gat cca tat age gat ttc 771 
Ser Ser Pro Glu Lys Trp Ser Thr Ala Ser Asp Pro Tyr Ser Asp Phe 
240 245 250 

gag aag gtc aca ggc egg ate gat aaa aat gtg tct cea gag get aga 819 
Glu Lys Val Thr Gly Arg He Asp Lys Asn Val Ser Pro Glu Ala Arg 
255 260 265 

cac ccc ctg gta gea gee tac ccg att gta eac gtg gac atg gag aac 867 
His Pro Leu Val Ala Ala Tyr Pro He Val His Val Asp Met Olu Asn 
270 275 280 285 

ate att eta age aaa aac gag gac cag tec aca caa aac act gac tec 915 
He He Leu Ser Lys Asn Glu Asp Gin Ser Thr Gin Asn Thr Asp Ser 
290 295 300 

gag aec cgc ace ata tct aaa aac ace agt act tea agg ace cac ace 963 
Glu Thr Arg Thr He Ser Lys Asn Thr Ser Thr ser Arg Thr His Thr 
305 310 315 

tct gaa gtg cac ggc aat gog gaa gtc eat gca teg ttt ttc gat att 1011 
Ser Glu Val His Gly Asn Ala Glu Val His Ala Ser Phe Phe Asp He 
320 325 330 

ggt ggc tee gtg tea gcc ggc ttt age aat age aac tee teg acg gtt 1059 
Gly Gly Ser Val Ser Ala Gly Phe Ser Asn Ser Asn Ser Ser Thr Val 
335 340 345 

gcc att gac cac tea ctg tea tta gca ggt gag agg act tgg get gaa 1107 
Ala He Asp Bis Ser Leu Ser Leu Ala Gly Glu Arg Thr Txp Ala Glu 
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350 355 360 365 

act atg ggt ctg aat acc gcc gat acg gcc egg etc aac gca aat att 1155 
Thr Met Qly Leu Asn Thr Ala Asp Thr Ala Arg Leu Asn Ala Aan lie 
370 375 380 

egg tac gtc aac aca ggg act get cct at a tat aac gtg ctg cct acg 1203 
Arg Tyr Val Aan Thr Oly rChr Ala Pro lie Tyr Asn Val Leu Pro Thr 
385 390 395 

aca agt ctt gtc ctg ggc aaa aat cag acc etc gca acc att aag gca 1251 
Thr Ser Leu Val Leu 61y Lys Asn Gin Thr Leu Ala Thr lie Lys Ala 
400 405 410 

aag gaa aat cag ctg age cag ate etc gcc cct aac aac tat tat oca 1299 
Lys Qlu Asn Oln Leu Ser Oln He Leu Ala Fro Asn Asn Tyr Tyr Pro 
415 420 425 

tec aaa aat tta gcc ccc ata gcc ctg aac gcc cag gac gac ttt tec 1347 
Ser Lys Asn Leu Ala Pro He Ala Leu Asn Ala Gin Asp Asp Phe Ser 
430 435 440 445 

tct acc ccc ata act atg aat tac aat cag ttc ctg gag ctg gaa aag 1395 
Ser Thr Pro lie Thr Met Asn Tyr Asn Gin Phe Leu Glu Leu Glu Lys 
450 455 460 

acg aag cag ctg aga eta gac acc gat cag gtg tat gga aac ata gcg 1443 
Thr Lys Gin Leu Arg Leu Asp Thr Asp Gin Val Tyr Gly Asn He Ala 
465 470 475 

aca tat aac ttt gag aac ggc cgc gtg cgc gtc gac act ggg tea aac 1491 
Thr Tyr Asn Phe Glu Asn Gly Arg Val Arg Val Asp Thr Gly Ser Asn 
480 485 490 

tgg tct gaa gtt ctg ccg caa att caa gag aca acc gee aga att ate 1539 
Trp Ser Glu Val Leu Pro Gin He Gin Glu Thr Thr Ala Arg He He 
495 500 505 

ttt aat ggg aag gac ttg aac ctt gtc gaa egt aga att gcc gcc gtg 1587 
Phe Asn Oly Lys Asp Leu Asn Leu Val Glu Arg Arg He Ala Ala Val 
510 515 520 525 

aac ccc agt gat cca etc gag acg act aaa ccg gat atg aca ctg aaa 1635 
Asn Pro Ser Asp Pro Leu Glu Thr Thr Lys Pro Asp Met Thr Leu Lys 
530 535 540 

gag get ctg aag att gcc ttc gga ttc aac gaa cct aat ggc aat ttg 1683 
Glu Ala Leu Lys He Ala Phe Gly Phe Asn Glu Pro Asn Gly Asn Leu 
545 550 555 

cag tat cag ggg aaa gac ate aca gag ttt gat ttc aat ttc gat cag 1731 
Gin Tyr Gin Gly Lys Asp He Thr Glu Phe Asp Phe Asn Phe Asp Oln 
560 565 570 

cag act tec caa aat ate aaa aat cag ttg gca gag ctg aat gcc acc 1779 
Gin Thr Ser Gin Asn He Lys Asn Gin Leu Ala Glu Leu Asn Ala Thr 
575 580 585 

aat ate tac acg gtt etc gat aaa ate aaa ctt aac gcc aag atg aac 1627 
Asn He Tyr Thr Val Leu Asp Lys He Lys Leu Asn Ala Lys Met Asn 
590 595 600 605 
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ata ttg att cga gac aaa cgc ttc cac tac gac cgc aac aat ata gcc 1875 

lie Leu He Arg Asp Iiye Arg Pbe hIb Tyr Asp Arg Asn Asn He Ala 
610 615 620 

gta ggc get gat gag tct gtc gtc aag gag get cat agg gaa gtt ate 1923 
Val Qly Ala Asp Olu 8er Val val liys Qlu Ala His Arg Olu Val He 
625 630 635 

aac age agt act gaa ggg ctg tta ctt aat ate gae aag gac att egg 1971 
Asn Ser Ser Thr Olu Gly Leu Leu Leu Asn He Asp Lys Asp He Arg 
640 645 650 

aag ate etg tec ggg tat ate gtg gag ate gag gat ace gag ggc ctg 2019 
Lys He Leu Ser Gly Tyr He Val Olu He Qlu Asp Thr Glu Gly Leu 
655 660 665 

aag gaa gtc att aac gac cgc tat gat atg ctg aac att tec age tta 2067 
Lys Glu Val He Asn Asp Arg Tyr Asp Met Leu Asn He Ser Ser Leu 
670 675 680 685 

cga cag gac ggt aag aca ttt att gac ttt aaa aag tat aac gac aag 2115 
Arg Gin Asp Gly Lys Thr Phe He Asp Phe Lys Lys Tyr Asn Asp Lys 
690 695 700 

eta CGC ctg tac att tec aac cca aat tac eiaa gtt aat gtg tat get 2163 
Leu Pro Leu Tyr He Ser Asn Pro Asn Tyr Lys Val Asn Val Tyr Ala 
705 710 715 

gta ace aag gag aac aca ate ate aat cca age gag aac ggc gat ace 2211 
Val Thr Lys Glu Asn Thr He He Asn Pro Ser Glu Asn Gly Asp Thr 
720 725 730 

age aca aat gga ate aaa aag ate ctt ata ttt agt aaa aaa ggc tac 2259 
Ser Thr Asn Gly He Lys Lys He Leu He Phe Ser Lys Lys Gly Tyr 
735 740 745 

gag ate ggt tgaggatec 2277 

Glu He Gly 

750 



<210> 8 
<211> 752 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Human TPA/synthetic antigen fusion protein 
<400> 8 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 10 15 



Ala Val Phe Val Ser Pro Ser Glu Val Lys Gin Glu Asn Arg Leu Leu 
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20 



25 



30 



Aan Qlu Ser Olu Ser Ser Ser Gin Oly lieu Leu GXy Tyr Tyr Pbe Ser 
35 40 45 



Aep Leu Asn Phe Gin Ala Pro Met Val Val Thr Ser Ser Thr Thr Gly 
50 55 60 



ABp Leu Ser He Pro Ser Ser Glu Leu Glu Asn He Pro Ser Glu Asn 
65 70 75 80 



Gin Tyr Phe Gin Ser Ala He Trp Ser Gly Phe He Lye Val Lys Lys 
85 30 95 



Ser Asp Glu Tyr Thr Phe Ala Thr Ser Ala Asp Asn Hie Val Thr Met 
100 105 110 



Tip Val Asp Asp Gin Glu Val He Asn Lys Ala Ser Asn Ser Asn Lys 
115 120 125 



He Arg Leu Glu Lys Gly Arg Leu Tyr Gin He Lys He Gin Tyr Gin 
130 135 140 



Arg Glu Asn Pro Thr Glu Lye Gly lieu Asp Phe Lys Leu Tyr Trp Thr 
145 ISO 155 160 



Asp Ser Gin Asn Lys Lys Glu Val He Ser Ser Asp Asn Leu Qln Leu 
165 170 175 



Pro Glu Leu Lys Gin Lys Ser Ser Asn* Thr Ser Ala Gly Pro Thr Val 
180 1B5 190 



Pro Asp Arg Asp Asn Asp Gly lie Pro Asp Ser Leu Glu Val Glu Gly 
195 200 205 



Tyr Thr Val Asp Val Lye Asn Lys Arg Thr Phe Leu Ser Pro Trp He 
210 215 220 



Ser Asn He His Glu Lys Lys Gly Leu Thr Lys Tyr Lys Ser Ser Pro 
225 230 235 240 



Glu Lys Trp Ser Thr Ala Ser Asp Pro Tyr Ser Asp Phe Glu Lys Val 
245 250 255 



Thr Oly Arg He Asp Lys Asn Val Ser Pro Glu Ala Arg His Pro Leu 
260 265 270 
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Val Ala Ala Tyr Pro He Val His Val Asp Met Glu Asa lie Xle Leu 
275 280 285 



Ser Lys Asn Glu Asp Glu 8er Thr Qln Asn Thr Asp 8er Olu Thr Arg 
290 295 300 



Thr He Ser Lys Asn Thr Ser Thr Ser Arg Thr His ITir Ser Glu Val 
305 310 315 320 



His Gly Asn Ala Glu Val His Ala Ser Phe Phe Asp He Gly Gly Ser 
325 330 335 



Val Ser Ala Gly Phe Ser Asn Ser Asn Ser Ser Thr Val Ala He Asp 
340 345 350 



His Ser Leu Ser Leu Ala Gly Glu Arg Thr Trp Ala Glu Thr Met Gly 
355 360 365 



Leu Asn Thr Ala Asp Hbr Ala Arg Leu Asn Ala Asn He Arg Tyr Val 
370 375 360 



Asn Thr Gly Thr Ala Pro He Ty^ Asn Val Leu Pro Thr Thr Ser Leu 
385 390 395 400 



Val Leu Gly Lys Asn Gin Thr Leu Ala Thr He Lya Ala Lys Glu Asn 
405 410 415 



Gin Leu Ser Gin He Leu Ala Pro Asn Asn Tyr Tyr Pro Ser Lys Asn 
420 425 430 



Leu Ala Pro He Ala Leu Asn Ala Gin Asp Asp Phe Ser Ser Thr Pro 
435 440 445 



He Thr Net Asn Tyr Asn Gin Phe Leu Glu Leu Glu Lys Thr Lys Gin 
450 455 460 



Leu Azg Leu Asp Thr Asp Gin Val Tyr Gly Asn He Ala Thr Tyr Asn 
465 470 475 480 



Phe Glu Asn Gly Arg Val Arg Val Asp Thr Oly Ser Asn Tzp Ser Glu 
485 490 495 



Val Leu Pro Gin He Gin Glu Thr Thr Ala Arg He He Phe Asn Gly 
500 505 510 



Lys Asp Leu Asn Leu Val Glu Arg Arg He Ala Ala Val Asn Pro Ser 
515 520 525 
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Aep Pro Leu Glu Thr Hir Ijys Pro Asp Met Thr Leu Lya Glu Ala Leu 
530 535 540 



Lys lie Ala Phe Qly Phe Aan Olu Pro Asn Gly Asn Leu Gin Tyr Gin 
545 550 555 560 



Oly Lys Asp lie Thr Glu Phe Asp Phe Asn Phe Asp Gin Gin Thr Ser 
565 570 575 



Gin Asn He Lys Asn Gin Leu Ala Glu Leu Asn Ala Thr Asn He Tyr 
580 585 590 



Thr Val Leu Asp Lys He Lys Leu Asn Ala Lys Met Asn lie Leu He 
595 600 605 



Arg Asp Lys Arg Phe His Tyr Asp Arg Asn Asn He Ala Val Gly Ala 
610 615 620 



Asp Glu Ser Val Val Lys Olu Ala His Arg Glu Val He Asn Ser Ser 
625 630 635 640 



Thr Glu Gly Leu Leu Leu Asn He Asp Lys Asp He Arg Lye He Leu 
645 650 655 



Ser Oly Tyr He Val Glu He Glu Asp Thr Glu Gly Leu Lys Glu Val 
660 665 670 



He Aan Asp Arg Tyr Asp Met Leu Asn lie Ser Ser Leu Arg Gin Asp 
675 680 685 



Gly Lys Thr Phe He Asp Phe Lys Lys Tyr Asn Asp Lys Leu Pro Leu 
690 695 700 



Tyr He Ser Asn Pro Asn Tyr Lys Val Asn Val Tyr Ala Val Thr Lys 
705 710 715 720 



Glu Asn Thr He He Asn Pro Ser Glu Asn Qly Asp Thr Ser Thr Asn 
725 730 735 



Gly He Lys Lys Ha Leu He Phe Ser Lys Lys Oly Tyr Glu He Oly 
740 745 750 



<210> 9 
<211> 2418 
<212> DHA. 
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<213> Artificial Sequence 
<220> 

<223> Synthetic coding region for Human TPA/synthetlc 
antigen fusion protein 

<220> 

<221> CDS 

<222> (13) . . (2409) 

<223> 



<400> 9 

gatatcgcca cc atg gat gca atg aag aga ggg etc tgc tgt gtg ctg ctg 51 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu 

15 10 

ctg tgt gga gca gtc ttc gtt teg ccc age gee ggc ggg cat ggg gac 99 
Leu Cys Gly Ala Val Phe Val Ser Pro 8er Ala Gly Gly Hi a Gly Asp 
15 20 25 

gtt ggc atg cat gtg aiaa gaa aag gag aaa aac- Eiag gac gaa aac aag 147 
Val Oly Net His Val hys Glu Lys Glu Lys Asn liys Asp Glu Asn Lys 
30 35 40 45 

cgt aaa gac gaa gaa cgt aat aaa aca cag gag gaa cac tta aag gag 195 
Arg Lys Asp Olu Glu Arg Asn Lya Thr Gin Glu Glu His Leu Lys Glu 
50 55 €0 

ate atg aag cac ata gta aag att gag gta aaa ggc gaa gag get gta 243 
lie Met Lys His He Val Lys He Glu Val Lys Gly Olu Glu Ala val 
65 70 75 

aag aag gag gca gca gaa aaa ctg ttg gag aag gtg cct tct gac gtc 291 
Lys Lys Glu Ala Ala Glu Lys Leu Leu Glu Lys Val Fro Ser Asp Val 
80 85 90 

tta gag atg tat aag gee ate ggc ggt aag ate tat ate gtg gac gga 339 
Leu Glu Met Tyr Lya Ala He Gly Gly Lys He Tyr He Val Asp Gly 
95 100 105 

gac ate act aaa cac ata tct etc gaa get etc tec gag gac aag ua 387 
Asp lie Tlir Lys Hig He Ser Leu Glu Ala Leu Ser Glu Asp Lys Lys 
110 115 120 125 

aag att aaa gac ate tac ggg aag gat gee tta ttg cac gag cac tac 435 
Lys Xle Lys Asp He Tyr Gly Lys Asp Ala Leu Leu His Glu His Tyr 
130 135 140 

gtt tac gca aag gag ggc tat gag ccc gtg etc gtt att cag agt agt 483 
Val Tyr Ala Lys Glu Gly Tyr Glu Pro Val Leu Val He Gin Ser Ser 
145 150 155 

gag gac tac gtc gag aat acc gag aaa get ctg aat gtg tat tac gag 531 
Glu Asp Tyr Val Glu Asn Thr Glu Lys Ala Leu Asn Val Tyr Tyr Glu 
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160 165 170 

ate gga aag att ctg tec egg gac ate ctg tee aaa ate aae cag cca 579 
He Oly Lys He Leu ser Arg Aap He Leu Ser Lys XXe Asn Oln Pro 
175 leO 185 

tac cag aaa tte ett gat gtt ctt aae aca ate aaa aae gcg tea gat 627 
lyr Oln Lys Phe Leu Asp Val Leu Aen Thr He Lys Asn Ala Ser Asp 
190 195 200 205 

age gac ggg eag gat ctt ctg ttt aca aat caa etc aag gaa cac ccc 675 
Ser Asp Gly Qln Asp Leu Leu Phe Thr Asn Oln Leu Lys 61u His Pro 

210 215 220 

act gat tte age gtg gag ttc etc gag cag aat tet aae gaa gte cag 723 
Thr Asp Phe Ser val Qlu Phe Leu Glu Gin Asn Ser Asn Glu Val Gin 
225 230 235 

gag gtg ttc gcc aag gca ttt gcg tac tat ate gaa ccc cag cat cgc 771 
Olu Val Phe Ala Lys Ala Phe Ala Tyr Tyr He Olu Pro Gin His Arg 
240 245 250 

gat gtg etc cag ctg tac gcc ccg gag gca ttt aae tac atg gac aaa B19 
Asp Val Leu Gin Leu Tyr Ala Pro Glu Ala Phe Asn Tyr Met Asp Lys 
255 260 265 

ttc aat gaa cag gag att aat ctg tet ctg gag gaa ctg aaa gac cag 667 
Phe Asn Glu Gin Glu He Asn Leu Ser Leu Glu Glu Leu Lys Asp Gin 
270 275 280 285 

agg atg etc tec egg tat gaa aag tgg gaa aag ate aaa cag cat tac 915 
Arg Met Leu Ser Arg Tyr Glu Lys Trp Glu Lys He Lys Gin His Tyr 
290 295 300 

cag cat tgg tec gac tec ctg tea gaa gag ggg cgc gge ctg ttg aaa 963 
Gin His Tzp Ser Asp Ser Leu Ser Glu Glu Gly Arg Gly Leu Leu Lys 

305 310 315 

aag ttg cag att ccc ate gag cet aag aaa gat gat ata ata cac tet 1011 
Lys Leu Gin He Pro He Glu Pro Lys Lys Asp Asp He He His Ser 
320 325 330 

eta age cag gag gag aag gaa etc ctg aag egg ata caa ate gac tea 1059 
Leu Ser Qln Glu Glu Lys Glu Leu Leu Lys Arg He Qln He Asp Ser 
335 340 345 

tec gat ttc ctt age aca gaa gag aag gag ttt eta aaa aaa ctt cag 1107 
Ser Asp Phe Leu Ser Thr Glu Glu Lys Glu Phe Leu Lys Lys Leu Gin 
350 355 360 365 

ata gat att aga gat tea ctg age gag gaa gag aag gag ctg etc aac 1155 
He Asp He Arg Asp Ser Leu Ser Qlu Glu Glu Lys Glu Leu Leu Asn 

370 375 380 

cga att caa gte gat agt teg aac ccc ttg tea gaa aaa gag aag gaa 1203 
Arg He Gin Val Asp Ser Ser Asn Pro Leu Ser Glu Lys Glu Lys Glu 
385 390 395 

ttc ctg aaa aag ttg aag etc gac ate cag ccg tac gat att aat cag 1251 
Phe Leu Lys Lys Leu Lys Leu Asp He Gin Pro Tyr Asp He Asn Gin 
400 405 410 
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cgg eta caa gac acc ggc ggt ctg att gat age ccc age ate aac ctt 1299 
Arg Leu Gin Aep Thr Qly Oly Leu lie A6p Ser Pro Ser lie Asn Leu 
415 420 425 

gac gta egg aag caa tat aag cge gae att caa aat ate gac gee eta 1347 
Aap Val Arg Lye Oln Tyr Lye Arg Asp He Oln Hsn He Aap Ala Leu 
430 435 440 445 

tta cat caa tec ata ggc tec acg eta tac aat aaa ate tat eta tac 1395 
Leu His Oln Ser He Qly Ser Thr Leu Tyr Asn Lys He Tyr Leu Tyr 
450 455 460 

gaa aac atg aat att aac aat etc ace get aea ctg gga gcg gac ctg 1443 
GIu Asn Met Asn He Asn Asn Leu Thr Ala Thr Leu Gly Ala Asp Leu 
465 470 475 

gtc gat agt aca gae aac aea aag ata aac aga ggt att ttc aac gaa 1491 
val Asp Ser Thr Asp Asn Thr Lys He Asn Arg Oly He Phe Asn Olu 
480 485 490 

ttc aaa aag aac ttt aag tat teg ate age agt aac tat atg att gtt 1539 
Phe Lys Lys Asn Phe Lys Tyr Ser He Ser Ser Asn Ty^^ Met He Val 
495 500 505 

gac ate aat gaa egg ccc gca tta gac aat gag agg ttg aag tgg aga 1587 
Asp He Asn Olu Arg Pro Ala Leu Asp Asn Olu Arg Leu Lys Trp Arg 
510 515 520 525 

att caa ctg agt cct gat act agg gee ggc tat ctg gag aac ggg aaa 1635 
He Gin Leu Ser Pro Asp Thr Arg Ala Gly Tyr Leu Olu Asn Oly Lys 
530 535 540 

ctg ate tta cag cga aac ate ggg ctg gag ate aag gat gtg cag att 1683 
Leu lie Leu Oln Arg Asn He Gly Leu Glu He Lys Asp Val Oln He 
545 550 555 

ate aag cag age gaa aaa gaa tac att cge ate gae gee aag gtg gtg 1731 
He Lys Oln Ser Olu Lys Olu Tyr He Arg He Asp Ala Lya Val Val 
560 565 570 

cct aag tea aag ate gat ace aag ate cag gaa get cag etc aac att 1779 
Pro Lys Ser Lys He Asp Thr Lys He Oln Olu Ala Oln Leu Asn He 
575 580 585 

aac cag gag tgg aat aaa get ctt ggt ctg cca aaa tac ace aaa ctt 1827 
Asn Gin Olu Trp Asn Lys Ala Leu Oly Leu Pro Lys lyr Thr Lys Leu 
590 595 600 605 

ate acc ttt aat gtg cac aac agg tat gee tct aat ate gtc gag tea 1875 
He Thr Phe Asn Val His Asn Arg Tyr Ala Ser Asn He Val Glu Ser 
610 615 620 

gca tac ctg att etc aat gaa tgg aag aac Giat att cag tct gac ctg 1923 
Ala Tyr Leu He Leu Asn Olu Trp Lys Asn Asn lie Oln Ser Asp Leu 
625 630 635 

ate aag aag gtc acg aat tat ctg gtg gac gga aat ggc aga ttc gtg 1971 
He Lys Lys Val Thr Asn Tyr Leu Val Asp Gly Asn Gly Arg Phe Val 
640 645 650 

ttc acc gae ata act ttg cca aac att gee gag caa tac act cat cag 2019 
Phe Thr Asp He Thr Leu Pro Asn He Ala Olu Oln Tyr Thr His Oln 
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655 660 665 

gat gaa att tac gag caa gtc cac tec aaa ggt ctg tat gtt cca gag 2067 
Asp Qlu He TyT Qlu Oln Val His Ser Lys Gly X<eu Tyr Val Pro Olu 
670 675 680 685 

tea aga teg att ctg etc cat ggt cca tec aaa ggg gtt gag ctt cga 2115 
Ser Arg Ser He Leu Leu His Gly Pro Ser Lye Gly Val Glu Leu Arg 
690 695 700 

aac gat tct gag gga ttt ate get gac ttt gga gee get gtg gat gac 2163 
Asn Asp Ser Glu Gly Phe He Ala Afip Fhe Gly Ala Ala Val Aap Asp 
705 710 715 

tac gee gga tac ctg ttg gat aag aat cag tct gat etc gtg aea aat 2211 
Tyr Ala Gly Tyr Leu Leu Asp Lys Asn Gin Ser Asp Leu Val Thr Asn 
720 725 730 

age aaa aaa ttc ata gat att ttc aag gag gaa ggg agt aac ctg act 2259 
Ser Lys Lys Phe He Asp He Phe Lys Glu Glu Gly Ser Asn Leu Thr 
735 740 745 

tec tat ggc cgc acg aac gag get gaa ttt ttt geg gaa gee ttt aga 2307 
Ser Tyr Gly Arg Thr Asn Glu Ala Glu Phe Phe Ala Glu Ala Phe A2rg 
750 755 760 765 

ctt atg cac age ace gac cat get gaa agg ttg aag gtg caa aag aat 2355 
Leu Met His Ser Thr Asp His Ala Glu Arg Leu Lys Val Gin Lys Asn 
770 775 780 

gcc cct aiaa acc ttc cag ttc ata aat gac cag ate aag ttc ate ate 2403 
Ala Pro Lys Thr Phe Gin Phe He Asn Asp Gin He Lys Phe He He 
785 790 795 

aac tct tgaggatcc 2418 
Asn Ser 



<210> 10 
<211> 799 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Human TPA/ synthetic antigen fusion protein 
<400> 10 

Met Asp Ala Met Lys Arg Gly Leu Cys Cye Val Leu Leu Leu Cya Gly 
15 10 15 



Ala Val Phe Val Ser Pro Ser Ala Gly Gly His Gly Asp Val Gly Met 
20 25 30 
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His Val Lys Olu Lys Olu Lya Asn Lys Asp Glu Asn Iiys Arg Lya Asp 
35 40 45 



Glu Olu Arg Asn Lys Thr Qln Glu Olu His Leu Lys Olu lie Met Lys 
50 55 60 



Hia lie Val Lya lie Glu Val Lys Gly Glu Glu Ala Val Lys Lys Glu 
65 70 75 60 



Ala Ala Glu Lys Leu Leu Glu Lys Val Fro Ser Asp Val Leu Glu Met 
85 90 95 



Tyx Lys Ala lie Gly Gly Lys lie Tyr lie Val Asp Gly Asp lie Thr 
100 105 110 



Lys His lie Ser Leu Glu Ala Leu Ser Glu Asp Lys Lys Lys lie Lys 
115 120 125 



Asp lie Tyr Gly Lys Asp Ala Leu Leu His Olu Bis Tyr Val Tyr Ala 
130 135 140 



Lys Glu Oly Tyr Olu Pro Val Leu Val lie Gin Ser Ser Glu Asp Tyr 
145 150 155 160 



Val Glu Asn Thr Glu Lys Ala Leu Asn Val Tyr Tyr Glu lie Gly Lys 
165 170 175 



He Leu Ser Arg Asp He Leu Ser Lys rie Asn Gin Pro Tyr Gin Lys 
ISO 165 190 



Phe Leu Asp Val Leu Asn Thr He Lys Asn Ala Ser Asp Ser Asp Gly 
195 200 205 



Gin Asp Leu Leu Fhe Thr Asn Gin Leu Lys Glu His Pro Thr Asp Phe 
210 215 220 



Ser Val Olu Phe Leu Glu Gin Asn Ser Asn Glu Val Oln Olu Val Phe 
225 230 235 240 



Ala Lys Ala Phe Ala Tyr Tyr He Glu Pro Gin Hia Arg Asp Val Leu 
245 250 255 



Oln Leu Tyr Ala Pro Glu Ala Phe Asn Tyr Met Asp Lys Phe Asn Glu 
260 265 270 



Gin Olu He Asn Leu Sex Leu Glu Glu Leu Lys Asp Gin Arg Met Leu 
275 2B0 285 
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Sor Arg Tyr Qlu Lys Trp Glu Lys lie Lys Gin Hie Tyr Gin His Trp 
290 295 300 



Ser Asp Ser Leu Ser Glu Glu Gly Axrg Gly Leu Leu Lys Lys Leu Gin 
305 310 315 320 



lie Pro lie Glu Pro Lys Lys Aep Asp He He His Ser Leu Ser Gin 
325 330 335 



Glu Glu Lys Glu Leu Leu Lys Arg He Gin lie Asp Ser Ser Asp Phe 
340 345 350 



Leu Ser Thr Qlu Glu Lys Glu Phe Leu Lys Lys Leu Gin lie Asp He 
355 360 365 



Arg Asp Ser Leu Ser Glu Glu Glu Lys Glu Leu Leu Asn Arg He Gin 
370 375 380 



Val Asp Ser Ser Asn Pro Leu Ser Glu Lys Glu Lys Glu Phe Leu Lys 
385 390 395 400 



Lys Leu Lys Leu Asp He Gin Pro Tyr Asp He Asn Gin Arg Leu Gin 
405 410 415 



Asp Thr Gly Gly Leu He Asp Ser Pro Ser He Asn lieu Asp Val Arg 
420 425 430 



Lys Gin Tyr Lys Arg Asp He Gin Asn He Asp Ala Leu Leu His Gin 
435 440 445 



Ser He Gly Ser Thr Leu Tyr Asn Lys He Tyr Leu Tyr Glu Asn Miet 
450 455 460 



Asn He Asn Asn Leu Thr Ala Thr Leu Gly Ala Aep Leu Val Asp Ser 
465 470 475 480 



Thr Asp Asn Thr Lye He Asn Arg Gly He Phe Asn Qlu Phe Lys Lys 
485 490 495 



Asn Phe Lys Tyr Ser He Ser Ser Asn Tyr Net He val Asp He Asn 
500 . 505 510 



Glu Arg Pro Ala Leu Asp Asn Glu Arg Leu Lys Trp Arg He Gin Leu 
515 520 525 



Ser Pro Asp thr Arg Ala Gly Tyr Leu Glu Asn Gly Lys Leu He Leu 
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530 



535 



540 



Gin Arg Abu lie Gly Leu Glu lie Lye Asp Val Gin He He Lys Gin 
545 550 555 560 



Ser Qlu Lye Glu Tyr He Arg He Asp Ala Lys Val Val Pro Lys Ser 
565 570 575 



Lys He Asp Thr Lys He Gin Glu Ala Gin Leu Asn He Asn Gin Glu 
580 585 5^0 



Trp Asn Lys Ala Leu Gly Leu Pro Lys Tyr Thr Lys Leu He Thr Phe 
595 600 605 



Asn Val His Asn Arg Tyr Ala Ser Asn He Val Glu Ser Ala Tyr Leu 
610 615 620 



He Leu Asn Glu Trp Lys Asn Asn He Gin Ser Asp Leu He Lys Lys 
625 630 635 640 



Val Thr Asn Tyr Leu Val Asp Gly Asn Gly Arg Phe Val Phe Thr Asp 
645 650 655 



He Thr Leu Pro Asn He Ala Qlu Gin Tyr Thr His Gin Asp Glu He 
660 665 670 



Tyr Glu Gin Val His Ser Lys Gly Leu Tyr Val Pro Glu Ser Arg Ser 
675 680 685 



He Leu Leu His Gly Pro Ser Lys Gly Val Glu Leu Arg Asn Asp Ser 
690 695 700 



Glu Gly Phe He Ala Asp Phe Oly Ala Ala Val Asp Asp Tyr Ala Gly 
705 710 715 720 



Tyr Leu Leu Asp Lys Asn Gin Ser Asp Leu Val Thr Asn Ser Lys Lys 
725 730 735 



Phe He Asp He Phe Lys Glu Glu Gly Ser Asn Leu Thr Ser Tyr Gly 
740 745 750 



Arg Thr Asn Glu Ala Glu Phe Phe Ala Glu Ala Phe Arg Leu Met His 
755 760 765 



Ser Thr Asp His Ala Glu Arg Leu Lys Val Gin Lys Aan Ala Pro Lys 
770 775 780 
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Thr Phe 61n Fhe ile Asn Asp Gin Ile Lys Phe Ile Ile Asn Ser 
785 790 795 

<210> 11 

<211> 3631 

<212> DMA 

<213> Bacillus anthracis 
<220> 

<221> CDS 

<222> (685) . . (3111) 

<223> 



<400> 11 
gcatcatatg 


gcaattatat 


tcatagcgcg ttttttcccg 


ttataaaaga 


taaaaactcg 


60 


aaaacaggaa 


aataatatac 


cttttattct atacagcaac 


ctaaatatta 


taatcaactt 


120 


ttccatagaa 


attaatcctb 


ttgttataca tctttattcc 


cttaacattg 


tcaaatttca 


180 


gttattcatt 


ctggatagtc 


aataaataga ttacggttat 


gttagtattt 


ttttaaaata 


240 


atagtattaa 


atagtggaat 


gcaaatgata aatgggcttt 


aaacaaaact 


aatgaaataa 


300 


tctacaaatg 


gaatttctcc 


agttttagat taaaccatac 


caaaaaaatc 


acactgtcaa 


360 


gaaaaatgat 


agaatcccta 


cactaattaa cataaccaaa 


ttggtagtta 


taggtagaaa 


420 


cttatttatt 


tctataatac 


catgcaaaaa acaaactaaa 


tattctgttc 


catactattt 


480 


tagtaaatta 


tttagcaagt 


aaattttggt gtataaacaa 


agtttatctt 


aatataaaaa 


540 


attactttac 


ttttatacag 


attaaaatga aaaatttttt 


atgacaagaa 


atattgcctt 


600 


taatttatga 


ggaaataagt 


aaaattttct acatacttta 


ttttattgtt 


gaaatgttca 


660 


cttataaaaa 


aggagagatt 


aaat atg aat ata aaa aaa gaa ttt 
Met Asn Ile Lys Lys Glu Phe 


ata aaa 
Ile Lys 


711 



1 5 

gta att agt atg tea tgt tta gta aca gca att act ttg agt ggt ccc 759 
Val lie Ser Net Ser Cys Leu Val Thr Ala Xle Thr Leu Ser Oly Fro 
10 15 20 25 

gtc ttt ate ccc ctt gta cag ggg gcg ggc ggt eat ggt gat gta ggt 807 
Val Phe Ile Pro Leu Val Gin Gly Ala Gly Gly His Qly Aap Val Gly 
30 35 40 

atg cac gta aaa gag aaa gag aaa aat aaa gat gag aat aag aga aaa 655 
Met His Val Lys Glu Lys Glu Lys Asn Lys Asp Glu Asn Lys Arg Lys 
45 50 55 
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gat gaa gaa cga aat aaa aca cag gaa gag cat tta aag gaa ate atg 903 
Asp Glu Glu Arg Asn Lya Thr Gin Glu Glu His Leu Lya Glu 11a Met 
60 65 70 

aaa cac att gta aaa ata gaa gta aaa ggg gag gaa get gtt aaa aaa 951 
Lye His He Val Lys lie Glu Val hys Gly Glu Glu Ala Val Lya Iiye 
75 80 65 

gag gca gca gaa aag eta ctt gag aaa gta cca tct gat gtt tta gag 999 
Glu Ala Ala Glu Lys Leu Leu Glu Lys Val Pro Ser Asp Val Leu Glu 
90 95 100 105 

atg tat aaa gca att gga gga aag ata tat att gtg gat ggt gat att 1047 
Met Tyr Lye Ala He Gly Gly Lys lie Tyr He Val Asp Gly Asp He 
110 115 120 

aca aaa cat ata tct tta gaa gca tta tct gaa gat aag aaa aaa ata 1095 
Thr Lys His He Ser Leu Olu Ala Leu Ser Glu Asp Lys Lys Lys lie 
125 130 135 

aaa gac att tat ggg aaa gat get tta tta cat gaa cat tat gta tat 1143 
Lys Asp He Tyr Gly Lys Asp Ala Leu Leu His Glu His Tyr Val Tyr 
140 145 150 

gca aaa gaa gga tat gaa ccc gta ctt gta ate caa tct teg gaa gat 1191 
Ala Lys Glu Gly Tyr Glu Fro Val Leu Val He Gin Ser Ser Glu Asp 
155 160 165 

tat gta gaa Eiat act gaa aag gca ctg £iac gtt tat tat gaa ata ggt 1239 
Tyr Val Glu Asn Thr Glu Lys Ala Leu Asn Val Tyr Tyr Glu He Gly 
170 175 180 185 

aag ata tta tea agg gat att tta agt aaa att aat caa cca tat cag 1287 
Lys He Leu Ser Arg Asp He Leu Ser Lys He Asn Gin Pro Tyr Gin 
190 195 200 

aaa ttt tta gat gta tta aat ace att aaa aat gca tct gat tea gat 1335 
Lys Phe Leu Asp Val Leu Asn Thr He Lys Asn Ala Ser Asp Ser Asp 
205 210 215 

gga caa gat ctt tta ttt act aat cag ctt aag gaa cat ccc aca gac 1383 
Gly Gin Asp Leu Leu Phe Thr Asn Gin Leu Lys Glu His Pro Thr Asp 
220 225 230 

ttt tct gta gaa ttc ttg gaa caa aat age aat gag gta caa gaa gta 1431 
Phe Ser Val Glu Phe Leu Glu Gin Asn Ser Asn Glu Val Gin Glu Val 
235 240 245 

ttt gcg aaa get ttt gca tat tat ate gag cca cag cat cgt gat gtt 1479 
Phe Ala Lys Ala Phe Ala Tyr Tyr He Glu Pro Gin His Arg Asp Val 
250 255 260 265 

tta cag ctt tat gca ccg gaa get ttt aat tac atg gat aaa ttt aac 1527 
Leu Gin Leu Tyr Ala Pro Glu Ala Phe Asn Tyr Met Asp Lys Phe Asn 
270 275 280 

gaa caa gaa ata aat eta tec ttg gaa gaa ctt aaa gat caa egg atg 1575 
Glu Gin Glu He Asn Leu Ser Leu Glu Glu Leu Lys Asp Gin Arg Met 
285 290 295 

ctg tea aga tat gaa aaa tgg gaa aag ata aaa cag cac tat caa cac 1623 
Leu Ser Arg Tyr Glu Lys Trp Glu Lys He Lys Gin His Tyr Gin His 
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300 305 310 

tgg age gat tct tta tct gaa gaa gga aga gga ctt tta aaa aag ctg 1671 
Trp Ser Asp Ser Leu Ser Olu Qlu Gly Arg Qly Leu Leu Lys Lys Leu 
315 320 325 

cag att cct att gag cca aag aaa gat gac ata att cat tct tta tct 1719 
Gin lie Pro lie Glu Pro Lys Lys Asp Asp He He His Ser Leu Ser 
330 335 340 345 

caa gaa gaa aaa gag ctt eta aaa aga ata caa att gat agt agt gat 1767 
61n Glu Glu Lys Glu Leu Leu Lys Arg lie Gin He Asp Ser Ser Asp 
350 355 360 

ttt tta tct act gag gaa aaa gag ttt tta aaa aag eta caa att gat 1815 
Phe Leu Ser Thr Glu Glu Lys Glu Phe Leu Lys Lys Leu Gin lie Asp 
365 370 375 

att cgt gat tct tta tct gaa gaa gaa aaa gag ctt tta aat aga ata 1863 
He Arg Asp Ser Leu Ser Glu Glu Glu Lys Glu Leu Leu Asn Arg He 
380 385 390 

cag gtg gat agt agt aat cct tta tct gaa aaa gaa aaa gag ttt tta 1911 
Gin Val Asp Ser Ser Asn Pro Leu Ser Glu Lys Glu Lye Glu Phe Leu 
395 400 405 

aaa aag ctg aaa ctt gat att caa cca tat gat att aat caa agg ttg 1959 
Lys Lys Leu Lys Leu Asp He Gin Pro Tyr Asp Xle Asn Gin Arg Leu 
410 415 420 425 

caa gat aca gga ggg tta att gat agt ccg tea att aat ctt gat gta 2007 
Gin Asp Thr Gly Gly Leu He Asp Ser Pro Ser He Asn Leu Asp Val 

430 435 440 

aga aag cag tat rbl& agg gat att caa aat att gat get tta tta cat 2055 
Arg Lys Gin Tyr Lys Arg Asp He Gin Asn He Asp Ala Leu Leu His 
445 450 455 

caa tec att gga agt ace ttg tac aat aaa att tat ttg tat gaa aat 2103 
Gin Ser He Gly Ser Thr Leu Tyr Asn Lys He Tyr lieu Tyr Glu Asn 
460 465 470 

atg aat ate aat aac ctt aca gca acc eta ggt gcg gat tta gtt gat 2151 
Met Asn He Asn Asn Leu Thr Ala Thr Leu Gly Ala Asp Leu Val Asp 
475 400 485 

tec act gat aat act aaa att aat aga ggt att tte aat gaa ttc aaa 2199 
Ser Thr Asp Asn Thr Lys He Asn Arg Gly He Phe Asn Glu Phe Lys 
490 495 500 505 

aaa aat ttc aaa tat agt att tct agt aac tat atg att gtt gat ata 2247 
Lys Asn Phe Lys Tyr Ser He Ser Ser Asn Tyr Met He Val Asp He 
510 515 520 

aat gaa agg cct gca tta gat aat gag cgt ttg aaa tgg aga ate caa 2295 
Asn Glu Arg Pro Ala Leu Asp Asn Glu Arg Leu Lys Trp Arg He Gin 
525 530 535 

tta tea cca gat act ega gca gga tat tta gaa aat gga aag ctt ata 2343 
Leu Ser Pro Asp Thr Arg Ala Gly Tyr Leu Qlu Asn Gly Lys Leu He 
540 545 550 
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tta caa aga aac ate ggt ctg gaa ata aag gat gta caa ata att aag 2391 
lieu QXn Arg Asn lie Qly Leu Glu lie Lye Asp Val Gin lie lie Lys 
555 560 565 

caa tec gaa aaa gaa tat ata agg att gat gcg aaa gta gtg cca aag 2439 
Oln Ser Qlu Iiya Glu Tyr lie Arg lie Asp Ala Lys Val Val Pro Lys 
570 575 5B0 585 

agt aaa ata gat aca aaa att caa gaa gca cag tta aat ata aat cag 2487 
Ser Lys He Asp T2ir Lye He Gin Glu Ala Gin Leu Asn He Aan Gin 
590 595 600 

gaa tgg aat aaa gca tta ggg tta cca aaa tat aca aag ctt att aca 2535 
Glu Trp Asn Lys Ala Leu Qly Leu Pro Lys Tyr Thr Lys Leu He OHir 
605 610 615 

ttc aac gtg cat aat aga tat gca tec aat att gta gaa agt get tat 2583 
Phe Asn val His Asn Arg Tyr Ala Ser Asn He Val Glu Ser Ala Tyr 
620 625 630 

tta ata ttg aat gaa tgg aaa aat aat att oaa agt gat ctt ata aaa 2631 
Leu lie Leu Asn Glu Trp Lys Asn Asn He Gin Ser Asp Leu He Lys 
635 640 645 

aag gta aca aat tac tta gtt gat ggt aat gga aga ttt gtt ttt acc 2679 
Lys Val Thr Asn Tyr Leu Val Asp Gly Asn Gly Arg Phe Val Phe Thr 
650 655 660 665 

gat att act etc cct aat ata get gaa caa tat aca cat caa gat gag 2727 
Asp He Thr Leu Pro Asn He Ala Glu Gin Tyr Thr His Gin Asp Qlu 
670 675 680 

ata tat gag caa gtt cat tea aaa ggg tta tat gtt cca gaa tec cgt 2775 
He Tyr Glu Gin Val His Ser Lys Gly Leu Tyr Val Pro Glu Ser Arg 
685 690 695 

tct ata tta etc cat gga cct tea aaa ggt gta gaa tta agg aat gat 2823 
Ser He Leu Leu His Qly Pro Ser Lys Gly Val Qlu Leu Arg Asn Asp 
700 70S 710 

agt gag ggt ttt ata cac gaa ttt gga cat get gtg gat gat tat get 2871 
Ser Qlu Qly Phe He His Qlu Phe Gly His Ala Val Asp Asp Tyr Ala 
715 720 725 

gga tat eta tta gat aag aac caa tct gat tta gtt aca aat tct aaa 29X9 
Qly Tyr Leu Leu Asp Lys Asn Gin Ser Asp Leu Val Thr Asn Ser Lys 
730 735 740 745 

aaa ttc att gat att ttt aag gaa gaa ggg agt aat tta act teg tat 2967 
Lys Phe He Asp He Phe Lys Glu Glu Gly Ser Asn Leu Thr Ser Tyr 
750 755 760 

ggg aga aca aat gaa gcg gaa ttt ttt gca gaa gee ttt agg tta atg 3015 
Gly Arg Thr Asn Glu Ala Qlu Phe Phe Ala Glu Ala Phe Arg Leu Met 
765 770 775 

eat tct acg gac cat get gaa cgt tta aaa gtt caa aaa aat get ccg 3063 
Hie Ser Thr Asp His Ala Qlu Arg Leu Lys Val Gin Lys Asn Ala Pro 
780 785 790 

aaa act ttc caa ttt att aac gat cag att aag ttc att att aac tea 3111 
Lys Thr Phe Gin Phe He Asn Asp Gin He Lys Phe He He Asn Ser 
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AAA 












abx;aaaaaCb 


bvcaaabgga 


ccnaat-aaca 


auaauaax^aa 


uaauaauaac 


4JL / JL 


ac cage c 


auuacgaagc 


aacbaauccc 


agacbugaca 


gcaavuCwug 


ggaagcacca 






aagg&ggcau 


ugccagaaug 


acacCbuacg 


cgcccgccag 


a w acgaaggc 




aaaaacaatg 


atcctgacct 


agaacttaat 


gataatgtta 


ttaataattt 


aatgcctttt 


3351 


ataggaatat 


tagtaaaagt 


gccgEiaaaga 


tcctgttgca 


aagcttttaa 


agaacatatt 


3411 


attctatcaa 


gtggctgtat 


attttgttaa 


ttttcaataa 


attktgtaat 


taagcatacg 


3471 


tcaaaaaacc 


gaaatctgag 


ctcagatttc 


ggttttttga 


cgtatcttat 


acagatttgc 


3531 


aggttttaat 


ttcatttgct 


caatattaca 


aggatcagtt 


taaatgcaag 


tatgatgcac 


3591 


atacatgttg 


ttaaagacaa 


taagacgaaa 


aatatttctc 






3631 



<210> 12 

<211> 809 

<212> PRT 

<213> Bacillus antliraciB 



<400> 12 

Net Aen Zle Lys Lye Glu Phe lie Lya Val He 6er Met Ser Cys Leu 
15 10 15 



Val Thr Ala Ha Thr Leu Ser Gly Pro Val Phe He Pro lieu Val Gin 
20 25 30 



Gly Ala Gly Oly His Gly Asp Val Gly Ket His Val Iiys Glu Lys Glu 
35 40 45 



Lys Aen Lye Asp Glu Asn Lys Arg Lys Asp Glu Qlu Arg Asn Lys Thr 
50 55 60 



Gin Glu Glu His Leu Lys Glu He Met Lys His He Val Lys He Glu 
65 70 75 80 



Val Lys Gly Glu Glu Ala Val Lys Lys Glu Ala Ala Glu Lys Leu Leu 
85 90 95 



Glu Lys Val Pro Ser Asp Val Leu Glu Met Tyr Lys Ala He Gly Gly 
100 105 110 



Lys He Tyr He Val Asp Gly Asp He ihr Lys His He Ser Leu Glu 
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115 



120 



125 



Ala Leu Ser Glu Asp Lya Lys Lys lie Lye Asp lie Tyr 61y Lys Asp 
130 135 140 



Ala Iteu Leu His Olu His Tyr Val Tyr Ala Lys Olu Gly Tyr Glu Pro 
145 150 155 160 



Val Leu Val lie Gin Ser Ser Glu Asp Tyr Val Glu Asn Thr Glu Lys 
165 170 175 



Ala Leu Asn Val Tyr Tyr Glu lie Gly Lys lie Leu Ser Arg Asp lie 
180 185 190 



Leu Ser Lya lie Asn Gin Pro Tyr Gin Lys Phe Leu Asp Val Leu Asn 
195 200 205 



Thr lie Lys Asn Ala Ser Asp Ser Asp Gly Gin Asp Leu Leu Phe Thr 
210 215 220 



Asn Gin Leu Lys Glu His Pro Thr Asp Phe Ser Val Glu Phe Leu Glu 
225 230 235 240 



Gin Asn Ser Asn Glu Val Gin Glu Val Phe Ala Lya Ala Phe Ala Tyr 
245 250 255 



Tyr lie Glu Pro Gin His Arg Asp Val Leu Gin Leu Tyr Ala Pro Glu 
260 265 270 



Ala Phe Asn Tyr Met Asp Lys Phe Asn Glu Gin Glu lie Asn Leu Ser 
275 280 285 



Leu Glu Glu Leu Lys Asp Gin Arg Met Leu Ser Arg Tyr Glu Lys Trp 
290 295 300 



Glu Lys lie Lys Gin His Tyr Gin His Trp Ser Asp Ser Leu Ser Glu 
305 310 315 320 



Glu Gly Arg Gly Leu Leu Lys Lys Leu Gin He Pro He Glu Pro Lys 
325 330 335 



Lys Asp Asp He lie His Ser Leu Ser Gin Glu Glu Lys Glu Leu Leu 
340 345 350 



Lys Arg He Gin He Asp Ser Ser Asp Phe Leu Ser Thr Glu Glu Lys 
355 360 365 
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Olu Phe lieu Lye Xiys Leu Oln lie Asp lie Arg Asp Ser Leu Ser Olu 
370 375 360 



Olu Glu Lys Glu Leu Leu Aan Arg lie Gin Val Asp Ser Ser Abzx Pro 
385 390 395 400 



Leu Ser Glu Lyo Glu Lys Glu Phe Leu Lys Lys Leu Lys Leu Asp lie 
405 410 415 



Gin Pro Tyr Asp lie Asn Gin Arg Leu Gin Asp Thr Gly Gly Leu lie 
420 425 430 



Asp Ser Pro Ser lie Asn Leu Asp Val Arg Lys Gin Tyr Lys Arg Asp 
435 440 445 



lie Gin Asn lie Asp Ala Leu Leu His Gin Ser He Gly Ser Thr Lou 
450 455 460 



Tyr Asn Lys lie Tyr Leu Tyr Glu Asn Met Asn He Asn Asn Leu Thr 
465 470 475 480 



Ala Thr Leu Gly Ala Asp Leu Val Asp Ser Thr Asp Asn Thr Lys He 
485 490 495 



Asn Arg Gly He Phe Abu Glu Phe Lys Lys Asn Phe Lys Tyr Ser He 
500 505 510 



Ser Ser Asn Tyr Met He Val Asp He Asn Glu Arg Pro Ala Leu Asp 
515 520 525 



Asn Glu Arg Leu Lys Trp Arg He Gin Leu Ser Pro Asp Thr Arg Ala 
530 535 540 



Gly Tyr Leu Glu Asn Gly Lys Leu He Leu Gin Arg Asn He Gly Leu 

545 550 555 560 

Glu He Lys Asp Val Gin He He Lys Qln Ser Glu Lys Glu Tyr He 
565 570 575 



Arg He Asp Ala Lys Val Val Pro Lys Ser Lys He Asp Thr Lys He 
580 585 590 



Gin Glu Ala Gin Leu Asn He Asn Gin Glu Trp Asn Lys Ala Leu Gly 
595 600 605 



Leu Pro Lys Tyr Thr Lys Leu He Thr Phe Asn Val His Asn Arg Tyr 
610 615 620 
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Ala Ser Aan lie Val Glu Ser Ala Tyr lieu lie Leu Aen Glu Trp Lys 
625 630 635 640 



Asn Asn lie Gin Ser Asp Leu He Lye Lye Val Thr Asn Tyr Leu Val 
645 650 655 



Asp Gly Aan Gly Arg Fhe Val Phe Thr Asp He Thr Leu Pro Asn He 
660 665 670 



Ala Qlu.Gln Tyr Thr His Gin Asp Glu He Tyr Glu Gin Val Hie Ser 
675 680 685 



Lys Oly Leu Tyr Val Pro Glu Ser Arg Ser He Leu Leu His Gly Pro 
690 695 700 



Ser Lys Oly Val Glu Leu Arg Asn Asp Ser Glu Gly Fhe He His Glu 
705 710 715 720 



Phe Gly His Ala Val Asp Asp Tyr Ala Gly Tyr Leu Leu Asp Lys Asn 
725 730 735 



Qln Ser Asp Leu Val Thr Asn Ser Lys Lys Phe He Asp He Phe Lys 
740 745 750 



Glu Glu Gly Ser Asn Leu Thr Ser Tyr Gly Arg Thr Asn Glu Ala Glu 
755 760 765 



Phe Phe Ala Glu Ala Phe Arg Leu Met His Ser Thr Asp His Ala Glu 
770 775 780 



Arg Leu Lys Val Gin Lys Asn Ala Pro Lys Thr Phe Gin Phe He Asn 
785 790 795 800 



Asp Gin He Lys Phe He Ha Asn Ser 
605 



<210> 13 

<211> 1740 

<212> HNA 

<213> Artificial Sequence 



<220> 
<223> 



Synthetic coding region for Human TPA/B. anthracis 
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antigen fusion protein 

<220> 

<221> CDS 

<222> (13) . . (1731) 

<223> 



<400> 13 

gatatcgcca cc atg gat gca atg aag aga ggg etc tgc tgt gtg ctg ctg 51 
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu 
15 10 

ctg tgt gga gca gtc ttc gtt teg ccc age gee ggc ggg cat ggg gac 99 
Leu Cys Oly Ala Val Phe Val Ser Pro Ser Ala Gly Qly His Oly Asp 
15 20 25 

gtt ggc atg cat gtg aaa gaa aag gag aaa aac aag gac gaa aac aag 147 
Val Gly Met His Val Lys Glu Lys Glu Lye Asn Lys Asp Glu Asn Lys 
30 35 40 45 

cgt aaa gac gaa gaa cgt aat aaa aca cag gag gaa cac tta aag gag 195 
Arg Lys Asp Glu OXu Arg Asn Lys Thr Gin Glu Glu His Leu Lys Glu 
50 55 60 

ate atg aag cac ata gta aag att gag gta aaa ggc gaa gag get gta 243 
Xle Met Lys His lie Val Lys lie Glu Val Lys Gly Glu Glu Ala Val 
fiS 70 75 

aag aag gag gca gca gaa aaa ctg ttg gag aag gtg cct tct gac gtc 291 
Lys Lys Glu Ala Ala Glu Lys Leu Leu Glu Lys Val Pro Ser Asp Val 
80 85 90 

tta gag atg tat aag gee ate ggc ggt aag ate tat ate gtg gac gga 339 
Leu Glu Met Tyr Lys Ala lie Gly Gly Lys He Tyr He Val Asp Gly 
95 100 105 

gac ate act aaa cac ata tct etc gaa get etc tec gag gac aag aaa 387 
Asp He Thr Lys His Xle Ser Leu Glu Ala Leu Ser Glu Asp Lys Lys 
110 115 120 125 

aag att aaa gac ate tac ggg stag gat gcc tta ttg cac gag cac tac 435 
Lys He Lys Asp He Tyr Gly Lys Asp Ala Leu Leu His Glu His Tyr 
130 135 140 

gtt tac gca aag gag ggc tat gag ccc gtg etc gtt att cag agt agt 483 
Val Tyr Ala Lys Glu Gly Tyr Glu Pro Val Leu Val He Gin Ser Ser 
145 150 155 

gag gac tac gtc gag aat ace gag aaa get ctg aat gtg tat tac gag 531 
Glu Asp Tyr Val Glu Asn Thr Glu Lys Ala Leu Asn Val Tyr Tyr Glu 
160 165 170 

ate gga aag att ctg tec egg gac ate ctg tec aaa ate aac cag cca 579 
He Gly Lys He Leu Ser Arg Asp He Leu Ser Lys He Asn Gin Pro 
175 180 185 

tac cag aaa ttc ctt gat gtt ctt etac aca ate eiaa aac geg tea gat 627 
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Tyx Gin Lys Fhe Leu Asp Val Leu Asn Thr He Lys Asn Ala Ser Asp 
190 195 200 205 

age gac ggg cag gat ctt ctg ttt aca aat caa etc aag gaa cac ccc 675 
Ser Asp Oly Qln Asp X«eu Leu Fhe Hut Asn Gin Leu Lys OXu His Pro 
210 215 220 

act gat ttc age gtg gag ttc etc gag cag aat tct aac gaa gtc cag 723 
Thr Asp Fhe Ser Val Glu Phe Leu Glu Gin Asn Ser Asn Glu Val Gin 
225 230 235 

gag gtg ttc gcc aag gca ttt gcg tac tat ate gaa ccc cag cat cgc 771 
Glu Val Phe Ala Lys Ala Fhe Ala Tyr Tyr He Glu Pro Gin His Arg 
240 245 250 

gat gtg etc cag ctg tac gcc ceg gag gca ttt aac tac atg gac aaa B19 
Asp Val Leu Gin Leu Tyr Ala Fro Glu Ala Phe Asn Tyr Met Asp Lys 
255 260 265 

ttc aat gaa cag gag att aat ctg tct ctg gag gaa ctg aaa gac cag 667 
Fhe Asn Glu Gin Glu He Asn Leu Ser Leu Glu Olu Leu Lys Asp Gin 
270 275 280 285 

agg atg etc tec egg tat gaa aag tgg gaa aag ate aaa cag cat tac 915 
Arg Met Leu Ser Arg Tyr Glu Lys Trp Glu Lys He Lys Gin His Tyr 
290 295 300 

cag cat tgg tec gac tec ctg tea gaa gag ggg cgc ggc ctg ttg aaa 963 
Gin His Trp Ser Asp Ser Leu Ser Glu Glu Gly Arg Gly Leu Leu Lys 
305 310 315 

aag ttg cag att ccc ate gag ect aag aaa gat gat ata ata cac tct 1011 
Lys Leu Gin He Fro lie Olu Fro Lys Lys Asp Asp He He His Ser 
320 325 330 

eta age cag gag gag aag gaa etc ctg aag egg ata caa ate gac tea 1059 
Leu ser Gin Glu Glu Lys Glu Leu Leu Lya Arg He Gin He Asp Ser 
335 340 345 

tec gat ttc ctt age aca gaa gag aag gag ttt eta aaa aaa ctt cag 1107 
Ser Asp Phe Leu Ser Thr Glu Glu Lys Glu Phe Leu Lys Lys Leu Gin 
350 355 360 365 

ata gat att aga gat tea ctg age gag gaa gag aag gag ctg etc aac 1155 
He Asp He Arg Asp Ser Leu Ser Glu Glu Glu Lys Glu Leu lieu Asn 
370 375 380 

cga att caa gtc gat agt teg aac ccc ttg tea gaa aaa gag aag gaa 1203 
Arg He Gin Val Asp Ser Ser Asn Pro Leu Ser Glu Lys Glu Lys Glu 
385 390 395 

ttc Ctg aaa aag ttg aag etc gac ate cag ceg tac gat att aat cag 1251 
Phe Leu Lys Lys Leu Lys Leu Asp He Gin Fro Tyr Asp He Asn Gin 
400 405 410 

egg eta caa gac ace ggc ggt ctg att gat age ccc age ate aac ctt 1299 
Arg Leu Gin Asp Thr Oly Gly Leu He Asp Ser Pro Ser He Asn Leu 
415 420 425 

gac gta egg aag caa tat aag cgc gac att caa aat ate gac gcc eta 1347 
Asp Val Arg Lys Gin Tyr Lys Arg Asp He Gin Asn He Asp Ala Leu 
430 435 440 445 
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tta cat caa tec at a ggc tec acg eta tac aat aaa ate tat eta tac 1395 
Leu His Qln Ser He GXy Ser Thr Leu Tyr Aen LyB lie Tyr I<eu Tyr 
450 455 460 

gaa aac atg aat att aac aat etc acc get aea ctg gga gcg gac ctg 1443 
Glu Asn Met Asn He Asn Asn Leu Thr Ala Thr Leu Qly Ala Asp Leu 
465 470 475 

gtc gat agt aca gac aac aca aag ata etac aga ggt att ttc aac gaa 1491 
Val Asp Ser Thr Asp Asn Thr Lya He Asn Arg Oly He Phe Asn Glu 
480 485 490 

ttc aaa aag aac ttt aag tat teg ate age agt aac tat atg att gtt 1539 
Phe Lys Lys Asn Phe Lys Tyr ser He Ser Ser Asn Tyr Met He Val 
495 500 505 

gac ate aat gaa egg cec gea tta gac aat gag agg ttg aag tgg aga 1567 
Asp He Asn Glu Arg Pro Ala Leu Asp Asn Glu Arg Leu Lys Trp Arg 
510 515 520 525 

att caa ctg agt cct gat act agg gee ggc tat ctg gag aac ggg aaa 1635 
He Gin Leu Ser Pro Asp Thr Arg Ala Gly Tyr Leu Glu Asn Gly Lys 
530 535 540 

ctg ate tta cag cga aac ate ggg ctg gag ate aag gat gtg eag att 1683 
Leu He Leu Gin Arg Asn He Gly Leu Glu He Lys Asp Val Gin He 
545 550 555 

ate aag cag age gaa aaa gaa tac att cgc ate gac gee aag gtg gtg 1731 
He Lys Gin Ser Glu Lys Glu Tyr He Arg He Asp Ala Lys Val Val 
560 565 570 

tagggatcc 1740 

<210> 14 

<211> 573 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Human TPA/B, anthracis antigen fusion protein 
<400> 14 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Qly 
15 10 15 



Ala Val Phe Val Ser Pro Ser Ala Gly Gly His Gly Asp Val Gly Met 
20 25 30 



His Val Lys Glu Lys Glu Lys Asn Lys Asp Glu Asn Lys Arg Lys Asp 
35 40 45 
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61u Glu Arg Asn Lye Thr Oln Glu Glu HIb lieu Lye Glu lie Met Lys 
50 55 60 



His lie Val Iiys lie Glu Val Lys Gly Glu Glu Ala Val Lys Lys Glu 
65 70 75 80 



Ala Ala Glu Lys Leu Leu Glu Lys Val Pro Ser Asp Val Leu Glu Met 
85 90 95 



Tyr Lye Ala lie Gly Gly Lys He Tyr lie Val Asp Gly Asp lie Thr 
100 105 110 



Lys His He Ser Leu Glu Ala Leu Ser Glu Asp Lys Lys Lys He Lys 
115 120 125 



Asp He Tyr Gly Lys Asp Ala Leu Leu His Glu His Tyr Val Tyr Ala 
130 135 140 



Lys Glu Gly Tyr Glu Pro Val Leu Val He Oln Ser Ser Glu Asp Tyr 
145 150 155 160 



Val Glu Asn Thr Glu Lys Ala Leu Asn Val Tyr Tyr Glu He Gly Lys 
165 170 175 



He Leu Ser Arg Asp He Leu Ser Lys He Asn Gin Pro Tyr Oln Lys 
180 185 190 



Phe Leu Asp Val Leu Asn Thr He Lys Asn Ala Ser Asp Ser Asp Gly 
195 200 205 



Gin Asp Leu Leu Phe Thr Asn Gin Leu Lys Glu His Pro Thr Asp Phe 
210 215 220 



Ser Val Glu Phe Leu Glu Gin Asn Ser Asn Glu Val Gin Glu Val Phe 
225 230 235 240 



Ala Lys Ala Phe Ala Tyr Tyr He Glu Pro Gin His Arg Asp Val Leu 
245 250 255 



Gin Leu Tyr Ala Pro Olu Ala Phe Asn Tyr Met Asp Lys Phe Asn Glu 
260 265 270 



Gin Glu He Asn Leu Ser Leu Glu Glu Leu Lys Asp Oln Arg Met Leu 
275 280 285 



Ser Arg Tyr Glu Lys Tip Glu Lys He Lys Gin His Tyr Oln His Trp 
290 295 300 
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8er Asp Ser Leu ser Glu Glu Gly Arg Gly Leu Leu Lye Lys Leu Gin 
305 310 315 320 



lie Pro lie Glu Pro Lye Lys Asp Asp lie lie Hie Ser Leu Ser Gin 
325 330 335 



Glu Glu Lys Glu Leu Leu Lys Arg tie Gin lie Asp Ser Ser Asp Flie 
340 345 350 



Leu Ser Thr Glu Glu Lys Glu Phe Leu Lys Lys Leu Gin lie Asp lie 
355 360 365 



Arg Asp Ser Leu Ser Glu Glu Glu Lys Glu Leu Leu Asn Arg lie Gin 
370 375 380 



Val Asp Ser Ser Asn Pro Leu Ser Glu Lys Glu Lys Glu Phe Leu Lys 
385 390 395 400 



Lys Leu Lys Leu Asp He Gin Pro Tyr Asp lie Asn Gin Arg Leu Gin 
405 410 415 



Asp Thr Gly Gly Leu lie Asp Ser Pro Ser He Asn Leu Asp Val Arg 
420 425 430 



Lys Gin Tyr Lys Arg Asp He Gin Asn He Asp Ala Leu Leu His Gin 
435 440 445 



Ser He Gly Ser Thr Leu Tyr Asn Lys He Tyr Leu Tyr Glu Asn Met 
450 455 460 



Asn He Asn Asn Leu Thr Ala Thr Leu Gly Ala Asp Leu Val Asp Ser 
465 470 475 480 



Thr Asp Asn Thr Lys He Asn Arg Gly He Phe Asn Glu Phe Lys Lys 
485 490 495 



Asn Phe Lys Tyr Ser He Ser Ser Asn Tyr Net He Val Asp He Asn 
500 505 510 



Glu Arg Pro Ala lieu Asp Asn Glu Arg Leu Lys Trp Arg He Gin Leu 
515 520 525 



Ser Pro Asp Thr Arg Ala Gly Tyr Leu Glu Asn Gly Lys Leu He Leu 
530 535 540 



Gin Arg Asn He Gly Leu Glu He Lys Asp Val Gin He He Lys Gin 
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545 550 555 560 

Ser Glu Lye Glu Tyr lie Arg He Asp Ala Lys Val Val 
565 570 



<210> 15 

<211> 753 

<212> DNA 

<213> Asrtlflclal Sequence 



<:220> 

<223> Syntiietic coding region for Human TPA/B, anthracia 
antigen fusion protein 

<220> 

<221> CDS 

<222> (13) . . (744) 

<223> 



<400> 15 

gatatcgcca cc atg gat gca atg aag aga ggg etc tgc tgt gtg ctg ctg 51 
Met Asp Ala Met Lys Arg Gly Leu Cys Cye Val Leu Leu 
15 10 

Ctg tgt gga gca gtc ttc gtt teg ccc age gee ggc ggg cat ggg gac 99 
Leu Cys Gly Ala Val Phe Val Ser Pro Ser Ala Gly Gly His Gly Asp 
15 20 25 

gtt ggc atg cat gtg aaa gaa aag gag aaa aac aag gac gaa aac aag 147 
Val Gly Met His Val Lys Glu Lys Glu Lys Asn Lys Asp Glu Asn Lys 
30 35 40 45 

cgt aaa gac gaa gaa cgt aat aaa aca cag gag gaa cac tta aag gag 195 
Arg Lys Asp Glu Qlu Arg Asn Lys Hhx Gla Glu Qlu His Leu Lys Glu 
50 55 60 

ate atg aag cac at a gta aag att gag gta aaa ggc gaa gag get gta 243 
Xle Met Lys His He Val Lys He Glu Val Lys Gly Glu Glu Ala Val 
65 70 75 

aag aag gag gca gca gaa aaa ctg ttg gag aag gtg cct tct gac gtc 291 
Lye Lys Glu Ala Ala Glu Lys Leu Leu Glu Lys Val Pro Ser Asp Val 
80 B5 90 

tta gag atg tat aag gee ate ggc ggt aag ate tat ate gtg gac gga 339 
Leu Glu Met Tyr Lys Ala He Gly Gly Lys He Tyr He Val Asp Gly 
95 100 105 

gac ate act aaa cac ata tct etc gaa get etc tec gag gac aag aaa 387 
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Asp He Thr Lys His He 8er Leu Olu Ala Leu Ser Olu Asp hye Lys 
110 115 12 0 125 

aag att aaa gac ate tac ggg aag gat gcc tta ttg cac gag cac tac 435 
Lys He Lys Asp He Tyr Gly Lya Asp Ala Leu Leu His Glu His Tyr 
130 135 140 

gtt tac gca aag gag ggc tat gag ccc gtg etc gtt att cag agt agt 483 
val Tyr Ala Lys Olu Gly Tyr Glu Pzro Val Leu Val He Gin Ser Ser 
145 150 155 

gag gac tac gtc gag aat acc gag aaa get ctg aat gtg tat tac gag 531 
Glu Asp Tyr Val Glu Asn Thr Glu Lya Ala Ijeu Asn Val Tyr Tyr Glu 
160 165 170 

ate gga aag att ctg tec egg gac ate ctg tec aaa ate aac cag cca 579 
He Gly Lys He Leu Ser Arg Asp He Leu Ser Lys He Asn Gin Pro 
175 180 185 

tac cag aaa ttc ctt gat gtt ctt aac aca ate aaa aac gcg tea gat 627 
Tyr Oln Lys Phe Leu Asp Val Leu Asn T]ir He Lys Asn Ala Ser Asp 
190 195 200 205 

age gac ggg cag gat ctt ctg ttt aca aat caa etc aag gaa cac ccc 675 
Ser Asp Gly Gin Asp Leu Leu Phs Thr Asn Gin Leu Lys Glu His Pro 
210 215 220 

act gat ttc age gtg gag ttc etc gag cag aat tct aac gEia gtc cag 723 
Thr Asp Phe Ser Val Glu Phe Leu Glu Gin Asn Ser Asn Glu Val Gin 
225 230 235 

gag gtg ttc gcc aag gca ttt tgaggatcc 753 
Glu Val Phe Ala Lys Ala Phe 
240 



<210> 16 

<211> 244 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Human TPA/B. anthracis antigen fusion protein 
<400> 16 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 10 15 



Ala Val Phe Val Ser Pro Ser Ala Gly Gly His Gly Asp Val Gly Met 
20 25 30 



His Val Lys Glu Lys Glu Lys Asn Lys Asp Glu Asn Lys Axg Lys Asp 
35 40 45 
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Olu Qlu Arg Asn Lys Thr Gin Olu Olu His Leu Lys Glu lie Met Lys 
50 55 60 



His lie Val Lya Xle Qlu Val Lys Gly Glu Glu Ala Val Lys Lys Glu 
65 70 75 80 



Ala Ala Glu Lys Leu Leu Glu Lys Val Pro 8er Asp Val Leu Qlu Met 
85 90 95 



Tyr Lye Ala lie Gly Gly Lys He Tyr He Val Asp Gly Asp He Thr 
100 105 110 



Lys His He Ser Leu Qlu Ala Leu 8er Glu Asp Lys Lys Lys He Lys 
115 120 125 



Asp He Tyr Gly Lys Asp Ala Leu Leu His Glu His Tyr Val Tyr Ala 
130 135 140 



Lys Glu Gly Tyr Glu Pro Val Leu Val He Gin Ser Ser Glu Asp Tyr 
145 150 155 160 



Val Glu Asn Thr Glu Lys Ala Leu Asn val Tyr Tyr Glu He Gly Lys 
165 170 175 



He Leu Ser Arg Asp He Leu Ser Lya He Asn Qln Pro Tyr Gin Lys 
180 185 190 



Phe Leu Asp Val Leu Asn Thr He Lys Asn Ala Ser Asp Ser Asp Gly 
195 200 205 



Gin Asp Leu Leu Phe Thr Asn Gin Leu Lys Glu His Pro Tlir Asp Phe 
210 215 220 



Ser Val Glu Phe Leu Qlu Qln Asn Ser Asn Glu Val Gin Glu Val Phe 
225 230 235 240 



Ala Lys Ala Phe 

<210> 17 
<211> 1788 

<2i2> mn 

<213> Artificial Sequence 
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<220> 

<223> Synthetic coding region for Human TPA/ synthetic 
antigen fusion protein 

<220> 

<221> CDS 

<222> (13) . • (1779) 

<223> 



<400> 17 

gatatcgcca cc atg gat gca atg aag aga ggg etc tgc tgt gtg ctg ctg 51 
Met Asp Ala Met Iiys Axg Gly Leu Cya Cys Val lieu l4eu 
15 10 

ctg tgt gga gca gtc ttc gtt teg ccc age age get ggg cca act gtg 
Leu Cys Oly Ala Val Phe Val Ser Pro Ser Ser Ala Qly Pro Thr Val 
15 20 25 

ccc gac aga gac aat gat gga ate cct gat agt eta gag gtt gag gga 147 
Pro Asp Arg Asp Asn Asp Oly He Pro Asp Ser Leu Glu Val Olu Qly 
30 35 40 45 

tac acg gta gat gtc aag aac aaa agg act ttt etc teg cct tgg ate 195 
Tyr Thr Val Asp Val Lys Asn Lye Arg Thr Phe Leu Ser Pro Trp He 
50 55 60 

tea aat ate cat gag aag aag ggg ctt acc aag tac aag tec tec ccc 243 
Ser Asn He His Qlu Lys Lys Gly Leu Thr Lys Tyr Lys Ser Ser Pro 
65 70 75 

gag aag tgg tct acc get tec gat cca tat age gat ttc gag aag gtc 291 
Glu Lys Trp Ser Thr Ala ser Asp Pro Tyr Ser Asp Phe Glu Lys Val 
80 85 90 

aca ggc egg ate gat aaa cag gtg tct cca gag get aga eac ccc otg 339 
Thr Gly Arg He Asp Lys Oln Val Ser Pro Glu Ala Arg His Pro Leu 
95 100 105 

gta gca gee tac ccg att gta cac gtg gac atg gag aac ate att eta 387 
Val Ala Ala Tyr Pro He Val His Val Asp Met Glu Aen He He Leu 
110 115 120 125 

age aaa aac gag gac cag tec aca eaa aac act gac tec gag acc cgc 435 
Ser Lys Asn Olu Asp Gin Ser Thr Gin Asn Thr Asp Ser Glu Thr Arg 
130 ' 135 140 

acc ata tct aaa cag acc agt act tea agg acc cac acc tct gaa gtg 483 
Thr He Ser Lys Gin Thr Ser thr Ser Arg Thr His Thr Ser Glu Val 
145 150 155 

cac ggc aat gcg gaa gtc cat gca teg ttt ttc gat att ggt ggc tec 531 
His Gly Asn Ala Glu Val His Ala Ser Phe Phe Asp He Gly Gly Ser 
160 165 170 

gtg tea gee ggc ttt age aat age cag tec teg acg gtt gee att gac 579 
val Ser Ala Gly Phe Ser Asn Ser Gin Ser Ser Thr Val Ala He Asp 
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175 180 185 

627 



675 



cac tea 
His fier 
190 


ctg 
lieu 


tea tta gca 
Ser Leu Ala 


ggt 
Gly 


gag 

Glu 


agg 
Arg 


act 
Thr 


tgg 
Trp 

200 


get 
Ala 


gaa 
Qlu 


act 
Thr 


atg 
Met 


ggt 
Gly 

205 


ctg aat 
Leu Asn 


acc 
Thr 


gcc gat acg 
Ala Asp Thr 
210 


gcc 
Ala 


egg 
Arg 


etc 
Leu 


aac 
Asn 

215 


gca 
Ala 


aat 
Aan 


att 
He 


egg 

Arg 


tac 
Tyr 

220 


gtc 
Val 


aac aca 
Asn Thr 


999 
Gly 


act get cct 
Thr Ala Pro 
225 


ata 
He 


tat 
Tyr 


aac 
Asn 
230 


gtg 
Val 


ctg 
Leu 


cct 
Pro 


acg 
Thr 


aca 
Thr 

235 


agt 
Ser 


ctt 
Leu 


gtc ctg 
Val Leu 


ggc 

Gly 
240 


aaa cag cag 
Lys Gin Gin 


ace 
Thr 


etc 
Leu 
245 


gca 
Ala 


acc 
Thr 


att 
He 


aag 
Lys 


gca 
Ala 

A S U 


aag 
Lys 


gaa 
Glu 


aat 
Asn 


cag ctg 
Gin Leu 
255 


age 
Ser 


cag ate etc 
Gin lie Leu 


gcc 
Ala 
260 


cct 
Pro 


aac 
Asn 


aac 
Aan 


tat 
Tyr 


tat 
Tyr 

265 

AOS 


cca 
Pro 


tec 
Ser 


aaa 
Lys 


aat 
Asn 


tta gcc 
Leu Ala 
270 


ccc 
Pro 


ata gcc ctg 
He Ala Leu 
275 


aac 
Asn 


gee 
Ala 


cag 
Gin 


gac 
Asp 


gac 
Asp 

2R0 


ttt 

Phe 


tec 
Ser 


tct 
Ser 


acc 
Thr 


ccc 
Pro 

2ft5 


ata act 
He Thr 


atg 
Met 


aat tac aat 
Asn Tyr Asn 


cag 
Gin 


ttc 
Phe 


ctg 
Leu 


gag 
Olu 


Ctg 
Leu 


gaa 
Glu 


aag 
Lys 


acg 
Thr 


aag 
Lys 


cag 
Gin 


ctg aga 
Leu Arg 


eta 
Leu 


gac acc gat 
Asp Thr Asp 
305 


cag 
Gin 


gtg 

Val 


tat 
Tyr 


gga 
Gly 


aac 
Asn 


ata 
lie 


gcg 
Ala 


aca 
Thr 

31 5 


tat 
Tyr 


aac 
Asn 


ttt gag 
Phe Qlu 


aac 
Asn 
320 


ggc cgc gtg 
Gly Arg Val 


cgc 
Arg 


gtc 
Val 


gac 
Asp 


act 
Thr 


ggg 

Gly 


tea 
Ser 


cag 
Gin 


tgg 
Trp 


tct 
Ser 


gaa 
Glu 


gtt ctg 
Val Leu 
335 


cog 
Pro 


caa att caa 
Gin He Gin 


gag 
Glu 


aca 
Thr 


acc 
Thr 


gcc 
Ala 


aga 
Arg 


att 
He 

■343 


ate 
He 


ttt 
Phe 


aat 
Asn 


ggg 

Gly 


aag gac 
Lys Asp 
350 


ttg 
Leu 


aac ctt gtc 
Asn Leu Val 


gaa 
Glu 


cgt 
Arg 


aga 
Arg 


att 
He 


gcc 
Ala 


gcc 
Ala 


gtg 
Val 


cag 
Gin 


ccc 
Pro 


agt 
Ser 

3o5 


gat cca 
Asp Pro 


etc 
Leu 


gag acg act 
Glu Thr Thr 
370 


aaa 
Lys 


ccg 
Pro 


gat 
Asp 


atg 
Met 
375 


aca 
Thr 


ctg 
Leu 


aaa 
Lys 


gag 
Glu 


get 
Ala 
380 


Ctg 
Leu 


aag att 
Lys lie 


gcc 
Ala 


ttc gga ttc 
Phe Gly Phe 
385 


aac 
Asn 


gaa 
Glu 


cct 
Pro 
390 


aat 
Aan 


ggc 

Gly 


aat 
Asn 


ttg 
Leu 


cag 
Gin 
395 


tat 
Tyr 


cag 
Gin 


ggg aaa 
Qly Lys 


gac 
Asp 
400 


ate aca gag 
He Thr Glu 


ttt 
Phe 


gat 
Asp 
405 


ttc 
Phe 


aat 
Asn 


ttc 
Phe 


gat 
Asp 


cag 
Gin 
410 


cag 
Gin 


act 
Thr 


tec 
Ser 


caa aat 
Qln Asn 
415 


ate 
He 


aaa aat cag 
Lys Asn Gin 


ttg 
Leu 
420 


gca 
Ala 


gag 
Glu 


ctg 
Leu 


cag 
Gin 


gcc 
Ala 
425 


acc 
Thr 


aat 
Asn 


ate 
He 


tac 
Tyr 



723 



771 



819 



867 



915 



963 



1011 



1059 



1107 



1155 



1203 



1251 



1299 
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acg gtt etc gat aaa ate aaa ctt aac gcc aag atg aac ata ttg att 1347 
thx Val Leu Asp Lya He Lys Leu Asn Ala Lys Met han He Leu He 
430 435 440 445 

cga gac aaa cgc ttc cac tac gac cgc aac aat ata gcc gta ggc get 1395 
Arg Asp Lys Arg Phe His Tyr Asp Arg Asn Asn He Ala Val Gly Ala 
450 455 460 

gat gag tct gtc gtc aag gag get cat agg gaa gtt ate cag age agt 1443 
Asp Olu 6er val val Lys Olu Ala His Arg Glu Val He Qln Ser Ser 
465 470 475 

act gaa ggg ctg tta ctt aat ate gac aag gac att egg aag ate ctg 1491 
Thr Glu Gly Leu Leu Leu Asn He Asp Lys Asp He Arg Lys He Leu 
480 485 490 

tec ggg tat ate gtg gag ate gag gat acc gag ggc ctg aag gaa gtc 1539 
Ser Gly Tyr He Val Glu He Glu Asp Thx Glu Gly Leu Lys Glu Val 
495 500 505 

att aac gac cgc tat gat atg ctg cag att tec age tta cga cag gac 1587 
He Asn Asp Arg Tyr Asp Het Leu Gin He Ser Ser Leu Arg Gin Asp 
510 515 520 525 

ggt aag aca ttt att gac ttt aaa aag tat aac gac aag eta ccc ctg 1635 
Gly Lys Thr Phe He Asp Phe Lys Lys Tyr Asn Asp Lys Leu Pro Leu 
530 535 540 

tac att tee aac cea aat tac aaa gtt aat gtg tat get gta acc aag 1683 
Tyr He Ser Asn Pro Asn Tyr Lys Val Asn val Tyr Ala Val Thr Lys 
545 550 555 

gag aac aca ate ate cag cca age gag aac ggc gat acc age aca aat 1731 
Glu Asn Thr He He Gin Pro Ser Glu Asn Gly Asp Thr Ser Thr Asn 

560 565 570 

gga ate aaa aag ate ctt ata ttt agt aaa aaa ggc tac gag ate ggt 1779 
Gly He Lys Lys He Leu He Phe Ser Lys Lys Gly Tyr Glu He Gly 
575 580 585 

tgaggatcc 1788 

<210> 18 

<211> 589 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Human TPA/aynthetic antigen fusion protein 
<400> 18 

Het Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 10 15 
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Ala Val Phe Val Ser Pro Ser Ser Ala Qly Pro Thr Val Pro Asp Arg 
20 25 30 



Asp Asn Asp Gly lie Pro Asp Ser Leu Qlu Val Qlu Sly Tyr Thr Val 
35 40 45 



Aap Val Lys Asn Lys Arg Thr Phe Leu Ser Pro Trp He Ser Asn lie 
50 55 SO 



His Qlu Lys Lys Gly Leu Thr Lys Tyr Lys Ser Ser Pro Qlu Lys Trp 
65 70 75 80 



Ser Thr Ala Ser Asp Pro Tyr Ser Asp Phe Qlu Lys Val Thr Gly Arg 
85 $0 95 



He Asp Lys Gin Val Ser Pro Glu Ala Arg His Pro Leu Val Ala Ala 
loo 105 110 



Tyr Pro He Val His Val Asp Met Glu Asn He He Leu Ser Lys Asn 
115 120 125 



Olu Asp Gin Ser Thr Qln Asn Thr Asp Ser Glu Thr Arg Thr He Ser 
130 135 140 



Lys Gin Thr Ser Thr Ser Arg Thr His Thr Ser Glu Val His Gly Asn 
145 150 155 160 



Ala Glu Val His Ala Ser Phe Phe Asp He Oly Gly Ser Val Ser Ala 
165 170 175 



Gly Phe Ser Asn Ser Gin Ser Ser Thr Val Ala He Asp His Ser Leu 
100 185 190 



Ser Leu Ala Gly Glu Arg Thr Trp Ala Qlu Thr Met Gly Leu Asn Thr 
195 200 205 



Ala Asp Thr Ala A3rg Leu Asn Ala Asn He Arg Tyr Val Asn Thr Gly 
210 215 220 



Thr Ala Pro He Tyr Asn Val Leu Pro Thr Thr Ser Leu Val Leu Gly 
225 230 235 240 



Lys Gin Gin Thr Leu Ala Thr He Lys Ala Lys Glu Asn Gin Leu Ser 
245 250 255 



Gin He Leu Ala Pro Asn Asn Tyr Tyr Pro Ser Lys Asn Leu Ala Pro 
260 265 270 
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Ilo Ala Leu Asa Ala Gin Asp Asp Phe Ser Ser Thr Pro He Thr Met 
275 280 285 



Abxx Tyr Asn Oln Phe Leu Qlu Leu Qlu Lys Thr Lys Gin Leu 2\rg Leu 
290 295 300 



Asp Thr Asp Gin Val Tyr Qly Asn He Ala Thr Tyr Asn Phe Qlu Asn 
305 310 315 320 



Gly Arg Val Arg Val Asp Hhx Qly Ser Gin Trp Ser Glu Val Leu Pro 
325 330 335 



Gin He Gin Glu Thr Thr Ala Arg He He Phe Asn Gly Lys Asp Leu 
340 345 350 



Asn Leu Val Glu Arg Arg He Ala Ala Val Gin Pro Ser Asp Pro Leu 
355 360 365 



Qlu Thr Thr Lys Pro Asp Met Thr Leu Lys Glu Ala Leu Lys He Ala 
370 375 380 



Phe Gly Phe Asn Glu Pro Asn Qly Asn Leu Gin Tyr Gin Gly Lys Asp 
385 390 395 400 



He Thr Glu Phe Asp Phe Asn Phe Asp Gin Gin Thr Ser Gin Asn He 
405 410 415 



Lys Asn Gin Leu Ala Glu Leu Gin Ala Thr Asn He Tyr Thr val Leu 
420 425 430 



Asp Lys He Lys Leu Asn Ala Lys Met Asn He Leu He Arg Asp Lys 
435 440 445 



Arg Phe His Tyr Asp Arg Asn Asn He Ala Val Gly Ala Asp Qlu Ser 
450 455 460 



Val Val Lys Glu Ala Bis Arg Glu Val He Gin Ser Ser Thr Glu Gly 
465 470 475 4B0 



Leu Leu Leu Asn He Asp Lys Asp He Arg Lys He Leu Ser Gly Tyr 
485 490 495 



He Val Glu He Glu Asp Thr Glu Gly Leu Lys Glu Val He Asn Asp 
500 505 510 



Arg Tyr Asp Met Leu Gin He Ser Ser Leu Arg Gin Asp Gly Lys Thr 
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515 520 525 



Phe lie Asp Phe Lys Lys Tyr Han Asp Lys Leu Pro Leu Tyr lie Ser 
530 535 540 



Asn Pro Asn Tyr Lys Val Asn Val Tyr Ala Val Thr Lys Olu Asn Olir 
545 550 555 560 



lie lie Gin Pro Ser Glu Asn Oly Asp Thr Ser Tbr Asn Gly He Lys 
565 570 575 



Lys He Leu He Phe Ser Lys Lys Oly Tyr Glu He Gly 
580 585 



<210> 19 

<211> 2418 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic coding region for Human TPA/ synthetic 
antigen fusion protein 

<220> 

<221> CDS 

<222> (13) . . (2409) 

<223> 



<400> 19 

gatatcgcca cc atg gat gca atg aag aga ggg etc tgc tgt gtg ctg ctg 51 
Net Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu 
15 10 

ctg tgt gga gca gtc ttc gtt teg ccc age gcc ggc ggg cat ggg gac 99 
Leu Cys Gly Ala Val Phe Val Ser Pro Ser Ala Gly Gly His Gly Asp 
15 20 25 

9tt ggc atg cat gtg aaa gaa aag gag aaia aac aag gac gaa aac aag 147 
Val Gly Met His Val Lya Glu Lys Glu Lye Asn Lys Asp Glu Asn Lys 
30 35 40 45 

cgt aaa gac gaa gaa cgt cag aaa aca cag gag gaa cac tta aag gag 195 
Arg Lys Asp Glu Glu Arg Gin Lys Thr Gin Glu Glu His Leu Lys Olu 
50 55 60 

ate atg aag cac ata gta aag att gag gta aaa ggc gaa gag get gta 243 
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Ile Met Lys His lie Val Lys lie Glu Val Lys Gly Glu Glu Ala Val 
65 70 75 

aag aag gag gca gca gaa aaa ctg ttg gag aag gtg cct tct gac gtc 291 
Lys Lys Glu Ala Ala Glu Iiya Leu Leu Glu Lys Val Pro Ser Asp Val 
eO 85 90 

tta gag atg tat aag gcc ate ggc ggt aag ate tat ate gtg gac gga 339 
Leu Glu Met Tyr Lys Ala Xle Gly Gly Lye He lyr lie Val Asp Gly 
95 100 105 

gac ate act aaa cac ata tct etc gaa get etc tec gag gac aag aaa 387 
Asp He Thr Lys His He Ser Leu Glu Ala Leu Ser Glu Asp Lys Lys 
110 115 120 125 

aag att aaa gac ate tac ggg aag gat gee tta ttg cac gag cac tac 435 
Lys He Lys Asp He Tyr Gly Lys Asp Ala Leu Leu His Glu His Tyr 
130 135 140 

gtt tac gca aag gag ggc tat gag ccc gtg etc gtt att cag agt agt 483 
Val Tyr Ala Lys Glu Gly Tyr Glu Pro Val Leu Val He Gin Ser Ser 
145 150 155 

gag gac tac gtc gag aat ace gag aaa get ctg aat gtg tat tac gag 531 
Glu Asp Tyr Val Glu Asn Thr Glu Lys Ala Leu Asn Val Tyr Tyr Olu 
160 165 170 

ate gga aag att ctg tec egg gac ate ctg tec aaa ate aac cag cca 579 
He Gly Lys He Leu Ser Arg Asp He Leu Ser Lys He Asn Gin Pro 
175 180 185 

tac cag aaa ttc ett gat gtt ctt aac aca ate aaa cag gcg tea gat 627 
Tyi- Gin Lys Phe Leu Asp Val Leu Asn Thr He Lys Gin Ala Ser Asp 
190 195 200 205 

age gac ggg cag gat ctt ctg ttt aca aat caa etc aag gaa cac ccc 675 
Ser Asp Gly Gin Asp Leu Leu Phe Thr Asn Gin Leu Lys Glu His Pro 
210 215 220 

act gat ttc age gtg gag ttc etc gag cag aat tct aac gaa gtc cag 723 
Thr Asp Phe Ser Val Glu Phe Leu Glu Gin Asn Ser Asn Glu Val Gin 
225 230 235 

gag gtg ttc gcc aag gca ttt gcg tac tat ate gaa ccc cag cat cgc 771 
Glu Val Phe Ala Lys Ala Phe Ala Tyr Tyr He Glu Pro Gin His Arg 
240 245 250 

gat gtg etc cag ctg tac gcc ccg gag gca ttt aac tac atg gac aaa 819 
Asp Val Leu Gin Leu Tyr Ala Pro Glu Ala Phe Asn Tyr Met Asp Lys 
255 260 265 

ttc aat gaa cag gag att cag ctg tct ctg gag gaa ctg aaa gac cag 867 
Phe Asn Glu Gin Glu He Gin Leu Ser Leu Glu Glu Leu Lye Asp Gin 
270 275 280 265 

agg atg etc tec egg tat gaa aag tgg gaa aag ate aaa cag cat tac 915 
Arg Met Leu Ser Arg Tyr Glu Lys Trp Glu Lys He Lys Gin His Tyr 
290 295 300 

cag cat tgg tec gac tec ctg tea gaa gag ggg cgc ggc ctg ttg aaa 963 
Gin His Trp Ser Asp Ser Leu Ser Glu Glu Gly Arg Gly Leu Leu Lys 
305 310 315 



yfO 2004/024067 



PCTAJS2003/028199 



-57- 

aag ttg cag att ccc ate gag cct aag aaa gat gat ata ata cac tct 3.011 
Lys Leu 61n He Pro He Olu Pro Lys Iiys Asp Asp He He His Ser 
320 325 330 

eta age cag gag gag aag gaa etc ctg aag egg ata caa ate gae tea 1059 
Leu Ser Gla Olu Glu Lys Glu Leu Leu Lys Arg He Oln He Asp Ser 
335 340 345 

tec gat ttc ctt age aca gaa gag aag gag ttt cta aaa aaa ctt cag 1107 
Ser Asp Phe Leu Ser Thr Glu Glu Lys Glu Phe Leu Lys Lys lieu Oln 
350 355 360 365 

ata gat att aga gat tea ctg age gag gaa gag aag gag etg etc aae 1155 
He Asp He Arg Asp Ser Leu Ser Olu Glu Glu Lys Olu Leu Leu Asu 
370 375 380 

cga att caa gte gat agt teg aae eec ttg tea gaa aaa gag aag gaa 1203 
Arg He Oln Val Asp Ser Ser Asn Pro Leu Ser Glu Lys Olu Lys Glu 
365 390 395 

ttc etg aaa aag ttg aag etc gac ate cag ccg tac gat att aat cag 1251 
Phe Leu Lys Lys Leu Lys Leu Asp He Gin Pro Tyr Asp He Asa Gin 
400 405 410 

egg cta caa gac ace ggc ggt ctg att gat age ccc age ate aac ctt 1299 
Arg Leu Gin Asp Thr Gly Oly Leu He Asp Ser Pro Ser He Asn Leu 
415 420 425 

gac gta egg aag caa tat aag cgc gac att caa aat ate gac gee cta 1347 
Asp Val Arg Lys Gin Tyr Lys Arg Asp He Gin Asn He Asp Ala Leu 
430 435 440 445 

tta cat caa tec ata ggc tec acg cta tac aat aaa ate tat cta tac 1395 
Leu His Gin Ser He Gly Ser Thr Leu Tyr Asn Lys He Tyr Leu Tyr 
450 455 460 

gaa aac atg aat att aac cag etc acc get aca etg gga gcg gae etg 1443 
Glu Asn Met Asn He Asn Gin Leu Thr Ala Thr Leu Oly Ala Asp Leu 
465 470 475 

gte gat agt aca gae uc aca aag ata aac aga ggt att ttc aac gaa 1491 
Val Asp Ser Thr Asp Asn Thr Lys He Asn Arg Oly He Phe Asn Glu 
460 485 490 

ttc aaa aag aac ttt aag tat teg ate age agt aae tat atg att gtt 1539 
Phe Lys Lys Asn Phe Lys Tyr Ser He Ser Ser Asn Tyr Met He Val 
495 500 505 

gac ate aat gaa egg ccc gca tta gac aat gag agg ttg aag tgg aga 1587 
Asp He Asn Olu Arg Pro Ala Leu Asp Asn Glu Arg Leu Lys Trp Arg 
510 515 520 525 

att caa ctg agt cct gat act agg gee gge tat ctg gag aae ggg aaa 1635 
Ha Gin Leu Ser Pro Asp Thr Arg Ala Gly Tyr Leu Oiu Asn Oly Lys 
530 535 540 

ctg ate tta cag cga aae ate ggg etg gag ate aag gat gtg cag att 1683 
Leu He Leu Oln Arg Asn He Oly Leu Glu He Lys Asp Val Gin He 
545 550 555 

ate aag cag age gaa aaa gaa tac att cgc ate gac gee aag gtg gtg 1731 
He Lys Oln Ser Olu Lys Olu Tyr He Arg He Asp Ala Lys Val Val 
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560 565 570 

cct aag tea aag ate gat acc aag ate cag gaa get eag etc aae att 1779 
Pro Lys Ser Lys He Asp T!hx Lys He Qln QIu Ala Gin Leu Asn He 
575 560 585 

aac cag gag tgg aat aaa get ctt ggt ctg oca ataa tac ace aaa ctt 1827 
Asn Gin Glu Tip Asn Lys Ala Leu Gly Leu Pro Lys Tyr Thr Lys Leu 
590 595 600 605 

ate acc ttt aat gtg cac aac agg tat gee tct BLat ate gtc gag tea 1875 
He Tlir Phe Asn Val His Asn Arg Tyr Ala 6er Asn He Val Qlu Ser 
610 615 620 

gca tac ctg att etc aat gaa tgg aag aac aat att cag tct gac ctg 1923 
Ala Tyr Leu He Leu Asn Olu Trp Lys Asn Asn He Gin Ser Asp Leu 
625 630 635 

ate aag aag gtc acg aat tat ctg gtg gac gga aat ggc aga ttc gtg 1971 
He Lys Lys Val Thr Asn lyr Leu Val Asp Gly Asn Gly Arg Phe Val 
640 645 650 

ttc acc gac ata act ttg cca aae att gee gag caa tac act cat cag 2019 
Phe Thr Asp He Thr Leu Pro Asn He Ala Glu Oln Tyr Thr His Gin 
655 660 665 

gat gaa att tac gag caa gtc cac tec aaa ggt ctg tat gtt cca gag 2067 
Asp Glu He Tyr Glu Gin Val His Ser Lye Gly Leu Tyr Val Pro Glu 
670 675 680 685 

tea aga teg att ctg etc cat ggt cca tec aEia ggg gtt gag ctt ega 2115 
Ser Arg Ser He Leu Leu His Gly Pro Ser Lys Gly Val Glu Leu Arg 
690 695 700 

cag gat tct gag gga ttt ate get gac ttt gga gee get gtg gat gac 2163 
Gin Asp Ser Glu Gly Phe lie Ala Asp Phe Gly Ala Ala Val Asp Asp 
705 710 715 

tac gee gga tac ctg ttg gat aag cag cag tct gat etc gtg aca aat 2211 
Tyr Ala Gly Tyr Leu Leu Asp Lys Gin Gin Ser Asp Leu Val Thr Asn 
720 725 730 

age aaa aaa ttc ata gat att ttc aag gag gaa ggg agt cag ctg act 2259 
Ser Lys Lys Phe He Asp He Phe Lys Glu Glu Gly Ser Qln Leu Thr 
735 740 745 

tec tat ggc cgc acg aac gag get gaa ttt ttt gcg gaa gee ttt aga 2307 
Ser Tyr Gly Arg Thr Asn Glu Ala Glu Phe Phe Ala Glu Ala Phe Arg 
750 755 760 765 

ctt atg cac age acc gac cat get gaa agg ttg aag gtg caa aag aat 2355 
Leu Met His Ser Thr Asp His Ala Glu Arg Leu Lys Val Oln Lys Asn 
770 775 780 

gee cct aaa acc ttc cag ttc ata aat gac cag ate aag ttc ate ate 2403 
Ala Pro Lys Thr Phe Gin Phe He Asn Asp Oln He Lys Phe He He 
785 790 795 

aac tct tgaggatce 2418 
Asn Ser 
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<210> 20 
<2X1> 799 
<212> FRT 

<213> Artificial Sequence 



<220> 

<223> H\iman TPA/ synthetic antigen fusion protein 
<400> 20 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
15 10 15 



Ala Val Phe Val Ser Pro Ser Ala Gly Gly His Gly Asp Val Gly Met 
20 25 30 



His Val Lya Glu Lys Glu Lys Asn Lys Asp Glu Asn Lys Arg Lys Asp 
35 40 45 



Glu Glu A3rg Gin Lys Hhx Gin Glu Glu His Leu Lys Glu lie Met Lys 
50 55 60 



His lie Val Lys He Glu Val Lys Gly Glu Glu Ala Val Lya Lys Glu 
65 70 75 80 



Ala Ala Glu Lys Leu Leu Glu Lys Val Pro Ser Asp Val Leu Glu Met 
85 90 95 



Xyr Lys Ala He Gly Gly Lys He Tyr He Val Asp Gly Asp He Thx 
100 105 110 



Lys His He Ser Leu Glu Ala Leu Ser Glu Asp Lys Lys Lys He Lys 
115 120 125 



Asp He Tyr Gly Lys Asp Ala Leu Leu His Glu His Tyr Val Tyr Ala 
130 135 140 



Lys Glu Gly Tyr Glu Pro Val Leu Val He Gin Ser Ser Glu Asp Tyr 
145 150 155 160 



Val Glu Asn Thr Glu Lys Ala Leu Asn Val Tyr Tyr Glu He Gly Lys 
165 170 175 



He Leu Ser Arg Asp He Leu Ser Lys He Asn Gin Pro Tyr Gin Lys 
180 185 190 
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Phe Leu Asp Val lieu Aen iThr He hys Oln Ala 8er Aep 8er Asp Gly 
195 200 205 



Oln Asp Leu Leu Phe Thr Aan Qln Leu Lye 61u Hie Pro Thr Asp Phe 
210 215 220 



Ser Val Olu Phe Leu Glu Oln Asn Ser Asn Olu Val Gin Glu Val Phe 
225 230 235 240 



Ala Lys Ala Phe Ala Tyr Tyr He Glu Pxro Gin His Arg Asp Val Leu 
245 250 255 



Gin Leu Tyr Ala Pro Glu Ala Phe Asn Tyr Met Asp Lys Phe Asn Glu 
260 2S5 270 



Gin Glu He Gin Leu Ser Leu Olu Glu Leu Lys Asp Oln Azg Met Leu 
275 280 285 



Ser Arg Tyr Glu Lys Trp Glu Lys He Lys Gin His Tyr Gin His Trp 
290 295 300 



Ser Asp Ser Leu Ser Glu Glu Gly Arg Gly Leu Leu Lys Lys Leu Oln 
305 310 315 320 



He Pro He Glu Pro Lys Lys Asp Asp He He His Ser Leu Ser Gin 
325 330 335 



Glu Glu Lys Olu Leu Leu Lys Arg He Gin He Asp Ser Ser Asp Phe 
340 345 350 



Leu Ser Thr Olu Glu Lys Olu Phe lieu Lys Lys Leu Oln He Asp He 
355 360 365 



Arg Asp Ser Leu Ser Glu Glu Glu Lyo Glu Leu Leu Asn Arg He Gin 
370 375 360 



Val Asp Ser Ser Asn Pro Leu Ser Glu Lys Glu Lys Olu Phe Leu Lys 
385 390 395 400 



Lys Leu Lys Leu Asp He Gin Pro Tyr Asp He Asn Oln Arg Leu Gin 
405 410 415 



Asp Thr Gly Gly Leu He Asp Ser Pro Ser Xle Asn Leu Asp Val Arg 
420 425 430 



Lys Oln Tyr Lys Arg Asp He Gin Asn He Asp Ala Leu Leu His Gin 
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435 



440 



445 



Ser He Gly Ser T2xr Leu Tyr Asn Lye He Tyr Leu Tyr Glu Asn Met 
450 455 460 



Asn He Asn Oln Leu Thr Ala Thr Leu Gly Ala Aap Leu Val Asp Ser 
465 470 475 480 



Thr Asp Asn Thr Lys lie Asn Arg Gly He Phe Asn Glu Phe Lys Lys 
485 490 495 



Asn Phe Lys Tyr Ser He Ser Ser Asn Tyr Met He Val Asp He Aan 
500 505 510 



Glu Arg Pro Ala Leu Asp Asn Glu Arg Leu Lye Trp Arg He Gin Leu 
515 520 525 



Ser Pro Asp Thr Arg Ala Gly Tyr Leu Glu Asn Gly Lys Leu He Leu 
530 535 540 



Gin Arg Asn He Gly Leu Glu tie Lys Asp Val Gin He He Lys Gin 
545 550 555 560 



Ser Glu Lys Glu Tyr He Arg He Asp Ala Lys Val Val Pro Lys Ser 
565 570 575 



Lys He Asp Thr Lys He Gin Glu Ala Gin Leu Asn He Asn Gin Glu 
580 585 590 



Xrp Asn Lys Ala Leu Gly Leu Pro Lys Tyr Thr Lys Leu He Thr Phe 
5S5 600 605 



Asn Val His Asn Arg Tyr Ala Ser Asn He Val Glu Ser Ala Tyr Leu 
610 615 620 



He Leu Asn Glu Trp Lye Asn Asn He Gin Ser Asp Leu He Lys Lys 
625 630 635 640 



Val Thr Asn Tyr Leu Val Asp Gly Asn Gly Arg Phe Val Phe Thr Asp 
645 650 655 



He Thr Leu Pro Asn He Ala Glu Gin Tyr Thr His Gin Asp Glu He 
660 665 670 



Tyr Glu Gin Val Hie Ser Lys Gly Leu Tyr Val Pro Glu Ser Arg Ser 
675 680 6BS 
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Ile Leu Leu His Gly Pro Ser Lye Qly Val Olu Leu Arg Qln Asp Ser 
690 695 700 



Glu Oly Phe lie Ala Asp Pbe Gly Ala Ala Val Asp Asp Tyr Ala Gly 
705 710 715 720 



Tyr Leu Leu Asp Lys Gin Gin Ser Asp Leu Val Thr Asn Ser Lye Lys 
725 730 735 



Phe He Asp He Phe Lys Glu Glu Gly Ser Gin Leu Thr Ser Tyr Gly 
740 745 750 



Arg Thr Asn Glu Ala Glu Phe Phe Ala Glu Ala Phe Arg Leu Net His 
755 760 765 



Ser Thr Asp His Ala Glu Arg Leu Lys Val Gin Lys Asn Ala Pro Lys 
770 775 780 



Thr Phe Gin Phe He Asn Asp Gin He Lys Phe He He Asn Ser 
785 790 795 



<210> 2i 

<211> 2292 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic coding region 
<400> 21 

atgaagaaga ggaaggtgct gatccccctg atggccctga gcaccatcct ggtgagcagc 60 

accggcaacc tggaggtgat ccaggccgag gtgaagcagg agaacaggct gctgaacgag 120 

agcgagagca gcagccaggg cctgctgggc tactacttca gcgacctgaa cttccaggcc 180 

cccatggtgg tgaccagcag caccaccggc gacctgagca tccccagcag cgagctggag 240 

aacatcccca gcgagaacca gtacttccag agcgccatct ggagcggctt catcaaggtg 300 

aagaagagcg acgagtacac cttcgccacc agcgccgaca accacgtgac catgtgggtg 360 

gacgaccagg aggtgatcaa caaggccagc aacagcaaca agatcaggct ggagaagggc 420 

aggctgtacc agatcaagat ccagtaccag agggagaacc ccaccgagaa gggcctggac 480 

ttcaagctgt actggaccga cagccagaac aagaaggagg tgatcagcag cgacaacctg 540 

cagctgcGcg agctgaagca gaagagcagc aacagcagga agaagaggag caccagcgcc 600 
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ggccccaccg tgcccgacag 


ggacaacgac 


ggcatccccg 


acagcctgga 


QCrt>QO aqQQC 


660 


tacaccgtgg acgtgaagaa 


caagaggacc 


ttcctgagcc 


cctggatcag 


caacatccac 


720 


gagaagaagg gcctgaccaa 


gtacaagagc 


agccccgaga 


aGrtqaacTcac 

■•3 ■'33**3 ****** 


cgc cagcgac 


780 


ccctacagcg acttcgagaa 


QCTbaaccacrc 


aggatcgaca 


aqaacQtcraq 

•*3 


c c c CQaocicc 


640 


aggcaccccc tggtggccgc 


cbaccccatc 




acatggagaa 


catca^cctg 


900 


agcaagaacg aggaccagag 


cacccagaac 


accgacagcg 


agaccaggac 


c a t c agcaag 


960 


aacaccagca ccagcaggac 


ccacaccagc 


craorqtcrcaccT 


qcaacGTccQa 


*5 3 V* *J ^ «» w 3 W V. 


1020 


agcttcttcg acatcggcgg 


caqcQtcraQC 

w*»5j W^ V^M^W 


gccggcttca 


gcaacagcaa 


caQC acre acc 


1080 


gtggccatcg accacagcct 


aaacctaacc 




cctaGKxccaa 


«jnwwBwyyyw 


1140 


ctgaacaccg ccgacaccgc 


caggctgaac 


gccaacatca 


ciqtaccitoaa 


caccQacacc 


1200 


gcccccatct acaacgtgct 


gcccaccacc 




faciei caaaaa 


ccaoacccsta 

WWHK^MWW W%«jy 


1260 


gccaccatca aggccaagga 


Giaaccaorcta 


aoccaaatcc 


t QQC c GG Gaa 


caaGtaGtaG 

W^PW (vMW 




cccagcaaga acctggcccc 


catcgcccbg 


aacgcccagg 


acoacbb caa 


cacrcacc ccc 


1380 


atcaccatga actacaacca 


gtlicctcrcracr 


ctggagaaga 


ccaagcagct 


y ay y b«jy »W 


1440 


accgaccagg tgtacggcaa 


catcgccacc 


tacaacttcg 


aqaacoocaa 


3 y ^'ss^y sjy **y 


1500 


gacaccggca gcaactggag 




ccccagatcc 




caccaaaati c 


1560 


atcttcaacg gcaaggacct 




craaaacracrcr a 


tccrcccrccqt 


Qaaccc caac 

3 www w 


1620 


gaccccctgg agaccaccaa 


gcccgacatg 


accc^aaaQcr 


aaQCCC t qaa 


qabcqccttc 


1660 


ggcttcaacg agcccaacgg 


caacctgcag 


taccagggca 


aggacatcac 


c qacrt t cciac 


1740 


ttcaacttcg accagcagac 


cagccagaac 


abcaagaacc 


aciGlicrcrc coa 


QCfcaaaGacc 

y w jww 


1800 


accaacatct acaccgtgct 


ggacaagatc 


aagctgaacg 


ccaaga^gaa 


catcetoate 

W n kfl w># w«J ^ f w 


1660 


agggaoaaga ggttccacta 


caacaaaaac 


aacatcacccr 


wy yy wy wwy 


wy Biy Bty wy i»y 


192 0 


aboraacTQaciq cccacaQaaa 


QCftiaaticaa c 




"■y yy w V *»y w 


Cf r* f* cr a j% ^ 33 4" 
y V i^y cmva 






^* dy c y g c 




agaucgagga 


caccgagggc 


n nil n 


ctgaaggfiigg tgatcaacga 


caggtacgac 


atgctgaaca- 


tcagcagcct 


gaggcaggac 


2100 


ggcaagacct tcatcgactt 


caagaagtac 


aacgacaagc 


tgcccctgta 


catcagcaac 


2160 


cccaactaca aggtgaacgt 


gtacgccgtg 


accaaggaga 


acaccatcat 


caaccccagc 


2220 


gagaacggcg acaccagcac 


caacggcatc 


aagaagatcc 


tgatcttcag 


caagaagggc 


2280 


tacgagatcg gc 










2292 



<210> 22 
<211> 2427 
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<212> DN& 

<213> Artificial Sequence 



<220> 

<223> Synthetic coding region 
<400> 22 



atgaacatca 


agaaggagtt 


catcaaggtg 


atcagcatga gctgcctggt 


gaccgccatc 


60 


accctgagcg 


gccccgtgtt 


catccccctg 


gtgcagggcg ccggcggcca 


cggcgacgtg 


120 


ggcatgcacg 


tgaaggagaa 


ggagaagaac 


aaggacgaga acaagaggaa 


ggacgaggag 


180 


aggaacaaga 


cccaggagga 


gcacctgaag 


gagatcatga agcacatcgt 


gaagatcgag 


240 


gtgaagggcg 


aggaggccgt 


gaagaaggag 


gccgccgaga agctgctgga 


gaaggtgccc 


300 


agcgacgtgc 


tggagatgta 


caaggccatc 


ggcggcaaga tctacatcgt 


ggacggcgac 


360 


atcaccaagc 


acatcagcct 


ggaggccctg 


agcgaggaca agaagaagat 


caaggacatc 


420 


tacggcaagg 


acgccctgct 


gcacgagcac 


tacgtgtacg ccaaggaggg 


ctacgagccc 


480 


gtgctggtga 


tccagagcag 


cgaggactac 


gtggagaaca ccgagaaggc 


cctgaacgtg 


540 


tactacgaga 


tcggcaagat 


Gctgagcagg 


gacatcctga gcaagatcaa 


ccagccctac 


600 


cagaagttcc 


tggacgtgct 


gaacaccatc 


aagaacgcca gcgacagcga 


cggccaggac 


660 


ctgctgttca 


ccaaccagct 


gaaggagcac 


cccaccgact tcagcgtgga 


gttcctggag 


720 


cagaacagca 


acgaggtgca 


ggaggtgttc 


gccaaggcct tcgcctacta 


catcgagccc 


780 


cagcacaggg 


acgtgctgca 


gctgtacgcc 


cccgaggcct tcaactacat 


ggacaagttc 


840 


aacgagcagg 


agatcaacct 


gagcctggag 


gagctgaagg accagaggat 


gctgagcagg 


900 


tacgagaagt 


gggagaagat 


caagcagcac 


taccagcact ggagcgacag 


cctgagcgag 


960 


gagggcaggg 


gcctgctgaa 


gaagctgcag 


atccccatcg agcccaagaa 


ggacgacatc 


1020 


atccacagcc 


tgagccagga 


ggagaaggag 


ctgctgaaga ggatccagat 


cgacagcagc 


1080 


gacttcctga 


gcaccgagga 


gaaggagttc 


ctgaagaagc tgcagatcga 


catcagggac 


1140 


agcctgagcg 


aggaggagaa 


ggagctgctg 


aacaggatcc aggtggacag 


cagcaacccc 


1200 


ctgagcgaga 


aggagaagga 


gttcctgaag 


aagctgaagc tggacatcca 


gccctacgac 


1260 


atcaaccaga 


ggctigcagga 


caccggcggc 


ctgatcgaca gccccagcat 


caacctggac 


1320 


gtgaggaagc 


agtacaagag 


ggacatccag 


aacatcgacg ccctgctgca 


ccagagcatc 


1380 


ggcagcaccc 


tgtacaacaa 


gatctacctg 


tacgagaaca tgaacatcaa 


caacctgacc 


1440 


gccaccctgg 


gcgccgacct 


ggtggacagc 


accgacaaca ccaagatcaa 


caggggcatc 


1500 


ttcaacgagt 


tcaagaagaa 


cttcaagtac 


agcatcagca gcaactacat 


gatcgtggac 


1560 
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atcaacgaga ggcccgccct ggacaacgag aggctgaagt ggaggatcca gctgagcccc 1620 

gacaccaggg ccggctacct ggagaacggc aagctgatcc tgcagaggaa catcggcctg X€80 

gagatcaagg acgtgcagat catcaagcag agcgagaagg agtacatcag gatcgacgcc 1740 

aaggtggtgc ccaagagcaa gatcgacacc aagatccagg aggcccagct gaacatcaac 1800 

caggagtgga acaaggccct gggcctgccc aagtacacca agctgatcac cttcaacgtg 1B60 

cacaacaggt acgccagcaa catcgtggag agcgcctacc tgatcctgaa cgagtggaag 1920 

aacaacatcc agagcgacct gatcaagaag gtgaccaact acctggtgga cggcaacggc 1980 

aggttcgtgt tcaccgacat caccctgccc aacatcgccg agcagtacac ccaccaggac 2040 

gagatctacg agcaggtgca cagcaagggc ctgtacgtgc ccgagagcag gagcatcctg 2100 

ctgcacggcc ccagcaaggg cgtggagctg aggaacgaca gcgagggctt catccacgag 2160 

ttcggccacg ccgtggacga ctacgccggc tacctgctgg acaagaacca gagcgacctg 2220 

gtgaccaaca gcaagaagtt cabcgacatc ttcewiggagg agggcagcaa cctgaccagc 2280 

tacggeagga ccaacgaggc cgagttcttc gocgaggcct tcaggctgat gcacagcacc 2340 

gaccaogccg agaggctgaa ggtgcagaag aacgccccca agaccttcca gttcatcaac 2400 

gaccagatca agttcatcat caacagc 2427 

<210> 23 
<211> 2295 

<2i2> xm 

<213> Artificial Sequence 
<220> 

<223> Synthetic coding region 
<400> 23 

atgaaaaagc gtaaggtgct gatccctctc atggcgctgt caactatctt ggtgagtagc 60 

accggcaacc tggaagtgat ccaagccgaa gtgaagcaag aaaatcgact tctgaacgag 120 

agcgaaagtt catcacaggg tcttctcgga tactacttca gtgacttgaa tttccaagca 180 

ccaatggtgg tgactagtag caccaccggc gatttgagca ttcccagctc tgagttggag 240 

aacattccca gcgaaaatca gtacttccag tctgctatct ggtccggatt cattaaggtt 300 

aaaaagtccg acgaatatac atttgctacc tcggcggata accatgtgac aatgtgggtg 360 

gacgaccagg aagtgatcaa caaggcttca aactctaata aaatccggct cgagaagggg 420 

aggctctacc agatcaaaat tcagtaccag cgggaaaacc ctacagaaaa aggactcgat 480 
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ttcaagctgt actggacaga tagccaaaac siagaaagaag ttatcagctc agacaatctg 540 

cagbtacccg agctcaagca gaagagtbct aattctagga agaaaagatc tacatccgca 600 

gggccaactg tgcccgacag agacaatgat ggutccctg atagtctaga ggttgaggga 660 

tacacggtag atgtcaagaa caaaaggact tttctctcgc cttggatatc aaatatccat 720 

gagaagaagg ggcttaccaa gtacaagtcc bcccccgaga agtggtctac cgcttccgat 780 

ccatatagcg atttcgagaa ggtcacaggc cggatcgata aaaatgtgtc tccagaggct 640 

agacaccccc tggtagcagc ctacccgatt gtacacgtgg acatggagaa catcattcta 900 

agcaaaaacg aggaccagtc cacacaaaac actgactccg agacccgcac catatctaaa 960 

aacaccagta cttcaaggac ccacacctct gaagtgcacg gcaatgcgga agtccatgca 1020 

tcgtttttcg atatkggtgg atccgtgtca gccggcttta gcaatagcaa ctcctcgacg 1080 

gttgccattg accactcact gtcattagca ggtgagagga cttgggctga aactatgggt 1140 

ctgaataccg ccgatacggc ccggctcaac gcaaatattc ggtacgtcaa cacagggact 1200 

gctcctatat ataacgtgct gcctacgaca agtcttgtcc tgggcaaaaa tcagaccctc 1260 

gcaaccatta aggcaaagga aaatcagctg agccagatcc tcgcccctaa caactattat 1320 

ccatccaaaa atttagcccc catagccctg aacgcccagg acgacttttc ctctaccccc 1380 

ataactatga attacaatca gttcctggag ctggaaaaga cgaagcagct gagactagac 1440 

accgatcagg tgtatggaaa catagcgaca tataactttg agaacggccg cgtgcgcgtc 1500 

gacactgggt caaactggtc tgaagttctg ccgcaaattc aagagacaac cgccagaatt 1560 

atctttaatg ggaaggactt gaaccttgtc gaacgtagaa ttgccgccgt gaaccccagt 1620 

gatccactcg agacgactaa accggatatg acactgaaag aggctctgaa gattgccttc 1660 

ggattcaacg aacctaatgg caatttgcag tatcagggga aagacatcac agagtttgat 1740 

ttcaatttcg atcagcagac ttcccaaaat atcaaaaatc agttggcaga gctgaatgcc 1800 

accaatatct acacggttct cgataaaatc aaacttaacg ccaagabgaa catattgatt 1860 

cgagacaaac gcttccacta cgaccgcaac aatatagccg taggcgctga tgagtctgtc 1920 

gtcaaggagg ctcataggga agttatcaac agcagtactg aagggctgtt acttaatatc 1980 

gacaaggaca ttcggaagat cctgtccggg tatatcgtgg agatcgagga taccgagggc 2040 

ctgaaggeiag tcattaacga ccgctatgat atgctgaaca tttccagctt acgacaggac 2100 

ggtaagacat ttattgactt taaaaagtat aacgacaagc tacccctgta catttccaac 2160 

ccaaattaca aagttaatgt gtatgctgta accaaggaga acacaatcat caatccaagc 2220 

gagaacggcg ataccagcac aaatggaatc aaaaagatcc ttatatttag taaaaaaggc 2280 

tacgagatcg gttga 2295 
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<210> 24 
<211> 2292 
<2I2> DM2^ 

<213> Artificial Sequence 
<220> 

<223> synthetic coding region 
<400> 24 

atgaaaaaga ggaaggtgct gatccctctc atggccctgt ctaccatcct ggttagtagc 60 

acagggaacc tggaagtgat tcaggccgag gttaagcaag agaataggct gctcaacgag 120 

tcagaatctt cgtcacaggg attattgggt tactattttt cggacctgaa tttccaggcc 100 

ccaatggtcg ttacaagctc cacaacaggc gacctgtcta tccccagctc cgaattggag 240 

aacatcccta gcgagaacca atactttcaa agcgctattt ggtcaggctt cataaaagtg 300 

aagaagtctg acgaatacac gtttgcaaca tctgccgata accacgtcac tatgtgggtc 360 

gatgaccagg aagtcatcaa caaggctagt aatagcaaca aaatcagact ggagaaaggg 420 

agattgtacc agatcaagat ccagtaccaa cgggaaaacc caacagagaa gggcctcgat 480 

tttaaactgt attggactga ctctcagaat aagaaggaag tgattagcag cgacaattta 540 

caattacccg agttgaaaca gaagagctct aattcaagga aaaagagatc tacctccgcc 600 

ggaccaacag ttccagatag ggataatgat ggaatccctg actcactgga ggtcgagggt 660 

tacaccgtgg acgtgaaaaa caaacgcact ttcctatcac cctggatctc caacattcac 720 

gagaagaagg gtctgactaa gtacaaatcc agcccagaga aatggagcac cgcaagtgat 780 

ccttatagtg acttcgagaa ggtgacgggc cggatagaca agaacgtatc acccgaagct 840 

egtcatcctc tggtcgccgc ctaccctatt gtgcatgtgg acatggaaaa catcatcctg 900 

agtaagaacg aagaccagag cactcagaac accgactccg agacacgaac gatatctaag 960 

aatacatcca cctcacgcac tcataccagc gaagtgcacg gtaacgctga agtgcacgcg 1020 

tccttcttcg acatcggcgg gtccgtgtcc gctggatttt ccaactccaa ctcttcgacc 1080 

gtagctattg accacagcct gagccttgcc ggagaaagga catgggcgga gactatgggc 1140 

ctgaatacgg ctgatacggc acggctcaat gccaacatca gatacgtgaa caccgggaca 1200 

gcccctattt acaatgtgct cccaaccaca tcactcgtac tgggaaaaaa ccagacccta 1260 

gctactatta aagcgaaaga aaatcagttg tcaeagatac tggcacccaa caattattat 1320 

ccaagcaaaa acctggcacc catcgcactc aatgcgcagg atgactttag tagtacaccc 1380 

attacaatga actacaatca gttccttgag ctcgagaaga ccaagcaact gagactcgac 1440 
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^8- 




actgaccagg 


tgtatggtaa 


tatcgccacc tacaacttcg aaaacggaag ggtgcgggta 


1500 


gatacgggct 


ccBLactggtc 


ggaggtgtta cctcagatcc aggaaaccac cgcccgcatt 


1560 


attttcaacg 


gcaaggatct 


aaatctcgfct gagcgccgga tagcagctgt caatcccagt 


1620 


gatccgctcg 


aaaccaccaa 


gcccgacatg actttgaagg aagctcttaa gattgccttt 


16B0 


gggtttaacg 


agcccaatgg 


gaatctgcag taccagggca aagatattac cgagtttgat 


1740 


tttaactttg 


atcagcagac 


aagccagaac attaagaatc aattagccga gctgaatgcc 


IBOO 


actaacatct 


atactgtttt 


ggacaagatt aaacttaatg caaagatgaa tatackaata 


1660 


cgagataagc 


gcttccatta 


tgatcgaaat aacatcgcag ttggcgccga cgaatcagtt 


1320 


gtgaaggagg 


cccacagaga 


ggtcattaat tcctctacag agggtcttct tctgaatatc 


1980 


gacaaggaca 


tacgtaagat 


cctgagcgga tatattgtag aaatcgaaga tactgagggg 


2040 




bcaviCaacga 


ccgcbabgat atgcttaaca tctctagttt gcggcaagat; 


2100 


ggaaagactt 


tcattgattt 


caagaaatac aacgataetac tcccgctgta tatctccaac 


2160 


ccaaattata 


a99tgaatgt 


gtacgctgtc accaaagaga ataccattat taacccgtct 


2220 


gagaatggcg 


acacctcqac 


gaatgggata aaaaaaatcc ttabcttcag taaaaaaggc 


2280 


tacgagatcg 


99 




22S2 



<210> 25 
<211> 2292 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic coding region 
<400> 25 

atgaaaaaaa ggaaggtgtt gatccccctt atggcccttt ccaccatctt ggtatcctca 60 
accggcaacc tggaggttat tcaagccgaa gtgaagcagg aaaatagact gctgaacgaa 120 
tccgaatcta gttctcaggg tctgctgggc tactatttta gcgacctcaa tttccaggca 180 
ccaatggtcg tgacttcgag caccacaggc gacttgagca ttccctcttc cgaactcgag 240 
aacataccaa gcgagaatca gtattttcag tccgcaatct ggtcgggttt tatcaaagta 300 
aaaaagagcg acgaatacac tttcgctacg tcagccgata atcatgtgac catgtgggtg 360 
gatgaccaag aggtcatcaa taaggcgagt aactctaaca agattcgact ggaaaaggga 420 
cgcctctatc agattaagat tcagtaccag cgtgagaacc ccactgaaaa gggtctggac 480 
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tttaagctgt attggacgga 


tagtcagaat 


aaaaaggagg 

• m m ^ ^ mm ^ ^ 


tgatcagttc 


agacaacttg 


540 


caattgcctg agctgaaaca 


gaagtccagc 


aactctcgga 


agaagcgcag 


bactagogct 


600 


ggcccaacag tccccgaccg 


cgacaatgat 


gggattcccg 


attctttgga 


aQbOQaonaa 


660 


tacacagrtgg acgtgaagaa 


caagagaaca 


ttcctgagtc 


catggattag 


taababccab 


720 


gagaaaaaag gtctaaccaa 


atacaaaagc 


agcccagaga 


agtggtcaac 


agcabcggab 


7eo 


ccttactccg atttcgagaa 


agttactggc 


aggattgaca 


agaacgtatc 


bccggaggcc 


840 


aggcatcctc tcgtcgccgc 


ttacccgatc 


gtccacgtcg 


acatggagaa 


catcatcctg 


900 


agtaagaatg aggatcaaag 


cactcagaat 


actgattccg 


agacacggac 


aabcagtaag 


960 


aatacctcaa cgagcaggac 


acacacctct 


gaagttcacg 


gcaatgccga 


ggbgcacgct 


1020 


tcattcttcg atatcggagg 


atccgtgagc 


9cgggcttca 


gcaactctaa 


ctcbbccacb 


1080 


gtagcgatcg atcatagcct 


ctccctagcc 


ggagagcgga 


catgggctga 


gaccatqggg 


1140 


ttgaatactg cagatacagc 


aagactgaac 


gccaatatta 


ggtatgtgaa 

M ^# w0 


tacaggbacc 


1200 


gcccccatct acaacgtccb 


tcctaccacc 


bcactggtgt 


taggcaaaaa 


tcagaccctc 


1260 


gccaccatta aggcaaaaga 


gaatcaactc 


tcacagatac 


bggccccaaa 


caacbabbac 


1320 


ccatctaaga atttagctcc 


cattgcttta 


aacgcccagg 


acgattttag 


cbcaacgcct 


1380 


atcaccatga attataacca 


gttcctggaa 


ctggaaaaaa 


ctaagcagct 


ccgcctggac 


1440 


accgatcagg tgtatggcaa 


catcgccaca 


tacaatttcQ 


aaaatcTQcrcg 


cgbbcgggbg 


1500 


gacaccgggt ccaactggag 


tgaagtccta 


ccccaaatcc 


aggaaaccac 


tgctcgaatc 


1560 


atcttcaatg gaaaagacct 


gaatcttgtg 


gagcggcgaa 


tcgccgctgt 


gaatccttcc 


1620 


gaccctctgg aaacgacgaa 


gcccgacatg 


actttgaaag 


aggcgctaaa 


aabcgcbbtb 


1680 


ggatttaatg aaccgaacgg 


caacttacaa 


tatcaaggga 


aggacattac 


cgagbbcgac 


1740 


tttaactttg atcagcagac 


ctcgcagaac 


ataaagaatc 


agctcgctga 


gctgaacgca 


1800 


acgaatatat acaccgtcct 


ggacaaaatt 


aagcttaacg 


ccaagatgaa 


cabccbcabb 


1860 


agagacaaga gatttcacta 


cgataggaat 


aacattzgccg 


bbciaacicccTa 


baacrbctahCT 


1920 


gtgaaagagg cacaccgcga 


ggtcattaac 


tccagcactg 


aacrocrcbacb 


CTC kQaacab t 


1960 


gacaaggata ttagaaaaat 




tacatcafcfcCT 








cttaaggaag ttataaacga 


ccgttatgac 


atgttaaaca 


babcaagcct 


ccggcaggac 


2100 


ggtaagacat ttatagattt 


caagaaatac 


aacgataagc 


ttcctcbbta 


cabcbcaaat 


2160 


cccaactata aggtgaatgt 


ttatgcagta 


acaaaagaaa 


atacaattat 


baatccabcc 


2220 


gagaacggcg atacatctac 


taacgggata 


aaaaaaatcc 


tcabcbtctc 


caagaaaggc 


2280 


tacgagatag gg 










2292 
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<210> 26 
<211> 2430 
<212> Wh 

<213> Artificial Sequence 
<220> 

<223> Synthetic coding region 
<400> 26 

atgaacatca aaaaagagtt tataaaggtg attagcatga gctgcctggt cactgccatt 60 

accctgagtg gcccagtgtt tatccctctc gtccagggcg ccggcgggca tggggacgtt 120 

ggcatgcatg tgaaagaaaa ggagaaaaac aaggacgaaa acaagcgtaa agacgaagaa 180 

cgtaataaaa cacaggagga acacttaaag gagatcatga agcacatagt aaagattgag 240 

gtaaaaggcg aagaggctgt aaagaaggag gcagcagaaa aactgttgga gaaggtgcct 3 00 

tctgacgtct tagagatgta taaggccatc ggcggtaaga tctatatcgt ggacggagac 360 

atcactaaac acatatctct cgaagctctc tccgaggaca agaaaaagat taaagacatc 420 

tacgggaagg atgccttatt gcacgagcac tacgtttacg caaaggaggg ctatgagccc 480 

gtgctcgtta ttcagagtag tgaggactac gtcgagaata ccgagaaagc tcbgaatgtg 540 

tattacgaga tcggaaagat tctgtcccgg gatatcctgt ccaaaatcaa ccagccatac 600 

cagaaattcc ttgatgttct taacacaatc aaaaacgcgt cagatagcga cgggcaggat 660 

cttctgttta caaatcaact caaggaacac cccactgatt tcagcgtgga gttcctcgag 720 

cagaattcta acgaagtcca ggaggtgttc gccaaggcat ttgcgtacta tatcgaaccc 780 

cagcatcgcg atgtgctcca gctgtacgcc ccggaggcat ttaactacat ggacaaattc 840 

aatgaacagg agattaatct gtctctggag gaactgaaag accagaggat gctctcccgg 900 

tatgaaaagt gggaaaagat caaacagcat taccagcatt ggtccgactc cctgtcagaa 960 

gaggggcgcg gcctgttgaa aaagttgcag attcccatcg agcctaagaa agatgatata 1020 

atacactctc taagccagga ggagaaggaa ctoctgaagc ggatacaaat cgactcatcc 1080 

gatttcctta gcacagaaga gaaggagttt ctaaaaaaac ttcagataga tattagagat 1140 

tcactgagcg aggaagagaa ggagctgctc aaccgaattc aagtcgatag ttcgaacccc 1200 

ttgtcagaaa aagagaagga atbcctgaaa aagttgaagc tcgacatcca gccgtacgat 1260 

attaatcagc ggctacaaga caccggcggt ctgattgata gccccagcat caaccttgac 1320 

gtacggaagc aatataagcg cgacattcaa aatatcgacg ccctattaca tcaatccata 1380 

ggatccacgc tatacaataa aatctatcta tacgaaaaca tgaatattaa caatctcacc 1440 
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ggc cga cagw 


B MB^VA MOB MB 

acagacaaca 


0% B B a^WB 4* B a B 

caaagauaaa 


cagaggtatt 


1 coo 




CCuCaaguaC 


ucgaccagca 


guaaccacau 


gacrgcugac 


i C£fO 




agacaaw^a^ 




ggagaac uca 


actgagtcct 






ggagaaoggg 


B B B J < i ■ ■ ^ ^ 

aaacugaucc 


uac agcgaaa 


catcgggctg 


1 fan 
Xo90 


era GIB A Ann A^rrVnrtanxa^ 

ya^aw6aa^9 augu^cagau 


baucaagcag 


agcgaaaaag 


aat acat teg 


catcgacgcc 


^ f it A 

174 U 




gaccgacacc 


aagauccagg 


aagctcagct 


caacattaac 


1800 




*^9y ^ 9 c c a 


B B S ^ A 9 


aactcaccac 


ccrcaacgcg 


looU 








^ ^ ^ ^ n B 

ugatccccaa 


rgaacggaag 


1920 


cig wo ci u oL%j i«\^aw^ i« 


na ^ ^ B a no A n 

^aitfCaacjaa^ 


#■* B »» IB 4* ^ 

gbCacgoAww 


auCbggcgga 


cggaaacggc 


1980 


M^n^^wMUMi* u^awfc»Mc*^*a Ci 


aan^^l'nf^/^A 
aacc wu^cca 


sa nA^* fr*n/^nn 

aaca b ugc eg 


agcaatacac 


t cat cagga t 


^ Ail fl 


era a a i* ^ V a arr^aant'ji^B 


n(*MnA na* 1* i 1 

wuccaaaggc 


Cugcacgxcc 


cagagtcaag 


arcgacccrg 


2100 


ctccatggtc catccaaagg 


ggttgagctt 


cgaaacgatt 


ctgagggatt 


tatccacgag 


2160 


tttggacacg ckgtggatga 


ctacgccgga 


tacctgttgg 


ataagaatca 


gtctgatctc 


2220 


gtgacaaata gcaaaaaatt 


catagatatt 


ttcaaggagg 


Aagggagtaa 


cctgacttcc 


2280 


tatggccgca cgaacgaggc 


tgaatttttt 


gcggaagcct 


ttagacttat 


gcacagcacc 


2340 


gaccatgctg aaaggttgaa 


ggtgcaaaag 


aatgccccta 


aaaccttcca 


gttcatEuiat 


2400 


gaccagatca agttcatcat 


caactcttga 








2430 


<210> 27 












<211> 2427 












<212> im 












<213> Artificial Sequence 










<220> 












<223> Synthetic coding region 










<400> 27 

atgaatatta ziaaaggagtt tattaaggtt 


atctctatgt 


cctgcttggt 


gacagcgata 


60 


acactgtcag gaccagtgtt 


catacccctt 


gtccaggggg 


ccggcggtca 


tggcgatgta 


120 


ggtatgcatg tgaaagagaa ggaaaaaaat 


aaagacgaga 


acaagaggaa 


ggacgaggaa 


180 


aggaataaga cccaagagga gcacctgaaa 


gagatcatga 


agcatattgt 


gaaaatcgag 


240 


gtgaaggggg aagaggccgt gaaaaaagaa 


gcagctgaga 


agctgctaga 


gaaagtgcct 


300 


tctgacgtcc tcgagatgta caaagcaatc 


ggcggcaaga 


tttacattgt 


tgatggtgac 


360 
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ataacaaagc atatttctct ggaggcctta agtgaagata agaaaaaaat caaagacatt 420 

tacggaaagg acgccctcct gcacgagcat tatgtctacg ckaaagaagg ctacgaaccc 480 

gtgctcgtca tccagtcatc tgaggattat gtggagaaca ccgaaiaaagc tttgaacgtc 540 

tabtacgaaa ttgggaaaat tctgtctaga gacattctca gcaagatcaa ccagccatac 600 

caaaaattcc tagacgttct aaatacgatc aagaatgcca gtgactccga cgggcaggat 660 

ctgttgttta cgaaccagct taaggagcat cctaccgatt tttctgtcga attccttgag 720 

cagaactcca atgaagttca agaggtcttt gctaaggctt tcgcgtacta catcgagcct 780 

cagcaccggg acgtgctgca gctctacgcc cctgaagctt tcaattatat ggacaagttc 840 

aatgaacagg aaattaacct gagtttagaa gaactgaaag accaaagaat gttgtccaga 900 

tacgagaagt gggagaagat caagcagcac tatcagcact ggtccgattc ccbbagcgaa 960 

gaagggcgcg ggctgcttaa aaagctgcag attccgatcg agccgaagaa agacgatata 1020 

attcattcac tgagccagga ggaaaaggag ctcctcaaac ggatccagat cgactcgtcc 1080 

gatttcctat ccacagagga aaaggaattt cttaaaaaac tccagattga tatacgggac 1140 

tcattatctg aggaggaaaa ggaactcctg aaccggatcc aggtcgatag tagceuicccc 1200 

Gtgtcagaaa aggaaaagga gtttctcaaa aaacttaagc tggatatcca accatacgac 1260 

atcaaccagc gactgcagga tactggaggc ttgatagatt ctccctccat aaacctggac 1320 

gtgaggaagc agtataagag ggatatccag aatatcgatg ccctgctgca ccaatctatc 1380 

ggaagtactc tttacaacaa aatctatctg tatgagaaca tgaatattaa taacctgact 1440 

gctaccttgg gcgccgacct ggtggactcg acggacaaca ccaaaatcaa ccgggggatc 1500 

ttcaatgaat ttaagaaaaa tttcaagtac tccatttcca gtaattatat gatagttgat 1560 

atcaacgagc ggccagcact ggacaaogag agattaaagt ggcgaattca actgagtccc 1620 

gatacacgcg ccggttacct cgagaacggt aagttgatct tgcagcgaaa catcggactc 1680 

gagattaagg atgtacagat catcaagcag agcgagaagg agtacattcg tabcgacgct 1740 

aaagtggtac caaaaagcaa aabtgacacc aagabacagg aggcacagct gaacataaat 1800 

caagaatgga abaaagcccb cggtcbgccb aagbabacta agctaabcac cbtbaacgtg 1660 

cacaabagab atgccagcaa battgbcgag agcgcabacc baabbcbgaa cgaabggaaa X920 

aacaatatcc aaagcgacct gabaaaaaag gtgactaabb abcbcgtbga bggcaabggc 1980 

cgcbbcgttt tcaccgatat bacbctcccc aacatcgcag aacagtacac tcatcaggac 2 040 

gagatbtabg aacaggtgca cagtaagggg ctgtabgtcc ctgagagccg cbctatcctt 2100 

cttcacggac cctcaaaggg cgbagagbba aggaabgaca gcgaggggtb cabbcacgag 2160 

bbtggccacg cagbggabga bbacgcbgga babcbccbgg abaagaacca gtccgaccbg 2220 
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gtgacaaact caaagaagtt catcgatatc tttaaggagg aaggcagcaa tttgaccagc 2280 

tacggacgca caaatgaggc cgagtttttt gccgaagcgt tccgtttgat gcattcaacc 2340 

gaccacgogg aaagactgaa agtgcagaaa aacgccccaa agacattcca gtttattaac 2400 

gaccaaatca agttcataat caattcg 2427 

<210> 28 

<211> 2427 

<212> im 

<213> Artificial Sequence 



<22d> 

<223> Synthetic coding region 
<400> 28 

atgaatatta ageuiagaatk cattaaagtc attagcatga gctgcctagt caccgccatc 60 

accctctctg gcccagtgtt tattccactg gtacagggcg caggcgggca tggcgacgtg 120 

ggaatgcatg ttaaggagaa ggaaaaaaat aaagatgaaa atiaaacgcaa ggacgaagaa 180 

cggaacaaga cccaggagga gcacctgaaa gagatcatga aacacattgt gaaaatcgaa 240 

gttaaaggtg aagaggccgt gaagaaggaa gccgcggaga aactgctgga aaaggtcccg 300 

tcggacgtac ttgaaatgta caaggcaatt ggtggcaaaa tctacattgt ggacggggac 360 

attaccaagc acataagcct ggaagcactc agcgaggaca agaagaaaat aaaggacatt 420 

tacggaaagg acgctctgct ccacgagcac tatgtctacg cgaaggaggg gtacgagccc 480 

gtgttggtga tacagagttc cgaggactat gttgaaaata ctgaaaaagc cctcaacgtg 540 

tactatgaga ttggtaagat cttgtctaga gacattctca gcaagattaa ccagccctac 600 

cagaaattcc tggatgtcct gaacacgatt aagaatgcct cagacagcga tggacaggac 660 

cttctgttta ccaatcagct taaagagcac cctaccgatt tctccgtgga attccttgag 720 

cagaattcaa atgaggtgca agaggtcttc gctaaggcct ttgcctacta tatcgagccc 780 

cagcatcgag adgtgctaca gttgtatgca ccagaagcct ttaactacat ggacaagttc 840 

aatgagcaag agatcaactt atcactggag gagctgaagg atcaacgcat gctgtctcgg 900 

tatgaaaaat gggagaaaat aaagcagcat taccagcatt ggagcgactc cctgtctgaa 960 

gagggtcgcg gcctcctgaa aaagctgcag attcctatcg agcctaaaaa agatgatata 102 0 

attcactcac tgtcccagga agagaaggag ctgcttaagc ggatccagat agattccagt 1080 

gacttcttaa gcacggaaga gaaggaattt ctgaaaaaat tgcagatcga tatccgtgat 1140 
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tccctgtcag 


aoQaoQaaaa 


aoaactocta aaccacrattc aaahoaat^G r*hnf*jia^r*r*a 




ctgtccgaaa 


aoQaQaacTcra 


oblitititiaaaQ aaaGtaaacrc licaataf^cca rrcct^accra^ 


■LAO W 


atcaatcaaa 


qorttcrcaoaa 


cacacicrcrcfora ttcrathoact caccc&a^fi^ caaotrt^aeiA^ 




QtaaQaaaqrc 


acrtacaaaco 


aaacatacacr a.ata.tcciafccf f*tGteGterr<a ^r*aaaeir*al*^ 


1 7 nn 

Xd DU 


gggt^ctacac 


bbbacaacaa 


oatttafctto tacaaaaaea tcraa^'a^naa f"aAt*^V^(TBr«r( 


1 AAA 


gccact^t:ag 


obcrcbo'atct 


CQhaoac^CC actaataaca r'aaacral'^AA f^nejeraana^^ 


1 tif\fi 
±3 UU 


ttcaatgagt 


licaaaaagaa 


t^tcaao^ah tctatfca&et cfcaaefcat-«^- aal-irrt-t-fTai- 




atcaacgaac 


gaccagccct 


acvacaacQaa cOQC^aaao^ ooecfciatT'ea <m^at"<^ar<r'rt 


XD^ V 


gacaccagag 


ctggcfcacct 


CCraaaatiaaa aaactcatt^c t*criaoaeTaaa hafr^rmxant-fr 


xoou 


gagataaagg 


acgttcaaali 


aataaaacaa acrcaaciaaacr aottaGa^saer aa^r*aa¥'t%r*^ 


X /4U 


aaggtcgtgc 


ccaaaagtaa 


cratiCQabacti aaQatitcaaci acrQct^Gaaf*!* £raaf'al~r>aal' 


1 linn 


cagaagtQQa 


ataacrcickct 


COOQCiltlCCC aaakacacaa aaa^a^^nan a^t^naa^^r^f- 


Xoou 


cataacagab 


acgcgtcbaa 


CatOatcaao fcCcaccrkatlG taatcofctaa ¥• as^ a aetata 
****'**Sll**^3*3 u\*wywyuaww u^ciuu>^uua<l v^cwlb^^&ag 


X74aU 


aacaacatcc 


acrbcccratcb 


GatcaaaaBB Gf^CAr*^aai^^ anl*f*an4'Bna ^m^ms9^#vab 
wawsi>na^e»att y bwawuaaci* cLB&v€Lg^S.ga ugggaaCgga 


J.90O 


cgctttgtgt 


ttacagatat 


cacactCCCa aacatCOCfco aacaat^atiac aeaGr^aacra^ 




gaaatctatg 


acicaaortorca 


wa.\j k.a.a«jy«j^ cu^uacgu^c cuga^a^bag aagcacccCw 


2x00 


ctgcacggcc 


ccagcaaagg 


cgtagagctt cgtaacgatt cggagggatt catacatgag 


2160 


tttgggcacg 


cagtcgacga 


ctatgccggt tatcttctcg acaaaaacca gagcgacttg 


2220 


gttaccaaca 


gtaagaaatt 


tatcgatatc ttcaaagaag agggctctaa cctaacatca 


2280 


tatggaagga 


ctaacgaggc 


agaattcttt gccgaagcct tccgcctgat gcattcaacc 


2340 


gaccacgcag 


agagactgaa 


agtgcagaaa aacgccccta agacttbcca gtttattaat 


2400 


gaccagatca 


Eiatttatcat 


caactct 


2427 



<210> 29 

<211> 26 

<212> Dim 

<213> Artificial Sequence 



<220> 

<223> Synthetic PGR primer 



<400> 29 

gagcttgata tcgccaccat ggatgc 



26 
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<210> 30 
<21X> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PCR primer 
<400> 30 

ccaccaatat ccgatgcatg gacttccgc 29 

<210> 31 

<211> 28 

<212> DKTA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PCR primer 

<400> 31 

cttgaaggat cctcaaccga tctcgtag 20 

<210> 32 

<211> 30 

<212> WA 

<213> Artificial Sequence 
<220> 

<223> Syntheric PCR Primer 

<400> 32 

ccatgcatcg gatattggtg gctccgtgtc 30 

<210> 33 

<211> 21 

<212> DNA 

<213> Artificisd. Sequence 
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<220> 

<223> Synthetic PCR primer 

<400> 33 

gtggacgacc aggaagtgat c 

<210> 34 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PCR primer 

<400> 34 

ggctatctgt ccagtacagc ttgaa 

<210> 35 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PCR primer 

<40Q> 35 

ccgtgctcgt tattcagagt 

<210> 36 

<2ai> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PCR primer 

<400> 36 

ccttctcttc tgtgctaagg 



76- 



21 



25 



20 
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<210> 37 

<2X1> 34 

<212> DliQl 

<213> Artificial Sequence 
<220> 

<223> Synthetic PGR primer 

<400> 37 

gaacctggat ccctacacca ccttggcgtc gatg 34 

<2X0> 36 

<211> 35 

<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PCR primer 

<400> 36 

gctaatggat cctcaaaatg ccttggcgaa cacct 35 

<210> 39 

<211> 876 

<212> D£IA 

<213> Artificial Sequence 
<220> 

<223> Synthetic coding region for Human/B. anthracis 
antigen fusion protein 

<220> 

<221> CDS 

<222> (13) . . (870) 

<223> 



<400> 39 
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gatatcgcca ce atg gat gca atg aag aga ggg etc tgc tgt gtg ctg ctg 51 

Met Asp Ala Met Lye Arg Gly Leu Cys Cys Val Leu Leu 

IS 10 

ctg tgt gga gca gtc ttc gtt teg ccc age gee ggc ggg cat ggg gac 99 
Leu Cys Gly Ala Val Phe Val Ser Pro Ser Ala Gly Gly His Gly Aap 
15 20 25 

gtt ggc atg cat gtg aaa gaa aag gag aaa aac aag gac gaa aac aag 147 
Val Gly Met His Val Lys Glu Lye Glu Lys Aaxx Lya Asp Glu Asn Lys 
30 35 40 45 

cgt aaa gac gaa gaa cgt aat aaa aca cag gag gaa cac tta aag gag 195 
Arg Lys Asp Qlu Glu Arg Asn Lys Thr Glu Glu Glu His Leu Lys Glu 
50 55 60 

ate atg aag cac ata gta aag att gag gta aaa ggc gaa gag get gta 243 
He Met Lys His lie Val Lys He Glu Val Lya Gly Glu Glu Ala Val 
65 70 75 

aag aag gag gca gca gaa aaa ctg ttg gag aag gtg cct tct gac gtc 291 
Lys Lys Glu Ala Ala Glu Lys Leu Leu Glu Lys Val Pro Ser Aap Val 
80 85 90 

tta gag atg tat aag gcc ate ggc ggt aag ate tat ate gtg gac gga 339 
Leu Glu Met Tyr Lys Ala He Gly Gly Lys He Tyr He Val Asp Gly 
95 100 105 

gac ate act aaa cac ata tct etc gaa get etc tec gag gac aag aaa 387 
Asp He Thx Lys His He Ser Leu Glu Ala Leu Ser Glu Asp Lys Lys 
110 115 120 125 

aag att aaa gac ate tac ggg aag gat gcc tta ttg cac gag cac tac 435 
Lys He Lys Asp He Tyr Gly Lys Asp Ala Leu Leu His Glu Hie Tyr 
130 135 140 

gtt tac gca aag gag ggc tat gag ccc gtg etc gtt att cag agt agt 483 
Val Tyr Ala Lys Glu Gly Tyr Qlu Pro Val Leu Val He Gin Ser Ser 
145 150 155 

gag gac tac gtc gag aat acc gag aaa get ctg aat gtg tat tac gag 531 
Glu Asp Tyr Val Glu Asn Thr Glu Lys Ala Leu Aan Val Tyr Tyr Glu 
160 165 170 

ate gga aag att ctg tec egg gac ate ctg tec aaa ate aac cag cca 579 
He Gly Lya He Leu Ser Arg Asp He Leu Ser Lya He Asn Gin Pro 
175 180 185 

tac cag aaa ttc ctt gat gtt ctt aac aca ate aaa aac gcg tea gat 627 
Tyr Gin Lys Phe Leu Asp Val Leu Asn Thr He Lys Asn Ala Ser Asp 
190 195 200 205 

age gac ggg cag gat ctt ctg ttt aca aat caa etc aag gaa cac ccc 675 
Ser Asp Gly Gin Asp Leu Leu Phe Thr Asn Gin Leu Lys Glu His Pro 
210 215 220 

act gat ttc age gtg gag ttc etc gag cag aat tct aac gaa gtc cag 723 
Thr Aap Phe Ser Val Glu Phe Leu Glu Gin Aan Ser Asn Glu Val Gin 
225 230 235 

gag gtg ttc gcc aag gca ttt gcg tac tat ate gaa ccc cag cat cgc 771 
Glu Val Phe Ala Lys Ala Phe Ala Tyr Tyr He Glu Pro Gin His Arg 
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240 245 250 

gat gtg etc cag ctg tac gcc ccg gag gca ttt aac tac atg gac aaa 819 
Asp Val Leu Gin Leu Tyr Ala Pro Glu Ala Phe Asa Tyr Met Asp hya 
255 260 265 

ttc aat gaa cag gag att aat ctg tot ctg gag gaa ctg aaa gac cag 867 
Phe Asn Glu Gin Glu lie Aan Leu Ser Leu Glu Glu Leu Lys Asp Gin 
270 275 280 285 

tga ggatcc 57 g 

<210> 40 

<211> 285 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Human/B, anthracia antigen fusion protein 
<400> 40 

Met Asp Ala Met Lya Arg Gly Leu Cya Cys Val Leu Leu Leu Cya Gly 
is 10 15 

Ala Val Phe Val Ser Pro Ser Ala Gly Gly His Gly Aap Val Gly Met 
20 25 30 

Hie Val Lys Glu Lya Glu Lys Asn Lys Asp Glu Asn Lys Arg Lys Asp 
35 40 45 

Qlu Glu Arg Asn Lys Thr Gin Glu Glu His Leu Lys Glu He Met Lys 
50 55 60 

His lie Val Lys He Glu Val Lys Gly Qlu Qlu Ala Val Lys Lys Glu 
^5 70 75 80 

Ala Ala Glu Lys Leu Leu Glu Lys Val Pro Ser Asp Val Leu Glu Met 
8S 90 95 

Tyr Lys Ala He Gly Gly Lys He Tyr He Val Asp Gly Asp He Thr 
100 105 110 

Lys His He Ser Leu Glu Ala Leu Ser Glu Asp Lys Lye Lys He Lys 
115 120 125 

Asp He Tyr Gly Lys Asp Ala Leu Leu His Glu His Tyr Val Tyr Ala 
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130 135 140 



Iiys Glu 61y Tyx Olu Pro Val Leu Val lie Oln Ser Ser Glu Asp Tyr 
145 150 155 160 



Val OXu Asn Thr Glu Lys Ala Leu Asn Val Tyr Tyx Glu lie Gly Lye 
165 170 175 



lie Leu Ser Arg Asp Zle Leu Ser Lye lie Asn Gin Pro Tyr Gin Lys 
180 185 190 



Phe Leu Asp Val Leu Asn Thr lie Lye Aan Ala Ser Asp Ser Aep Gly 
195 200 205 



Gin Asp Leu Leu Phe Thr Asn Gin Leu Lys Glu Hie Pro Thr Aep Phe 
210 215 220 



Ser Val Glu Phe Leu Glu Gin Asn Ser Asn Glu Val Gin Glu Val Phe 
225 230 235 240 



Ala Lye Ala Phe Ala Tyr Tyr lie Glu Pro Oln Hie Arg Asp Val Leu 
245 250 255 



Gin Leu Tyr Ala Pro Olu Ala Phe Asji Tyr Mat Asp Lye Phe Asn Glu 
260 265 270 



Gin Glu He Aan Leu Ser Leu Qlu Glu Leu Lys Asp Gin 
275 280 285 



<210> 41 

<211> 35 
<212> 

<213> Artificial Sequence 
<220> 

<223> Synthetic PGR primer 

<400> 41 

ccatacggat cctcactggt ctttcagttc ctcca 35 

<210> 42 

<211> 24 

<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Synthetic PCR primer 

<400> 42 

ctggagacac ctgtttatcg atcc 24 

<2X0> 43 

<211> 24 

<212> mh 

<213> Artificial Sequence 
<220> 

<223> Synthetic PCR primer 

<400> 43 

ggatcgataa acaggtgtct ccag 24 

<210> 44 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PGR primer 

<400> 44 

gaagtactgg tctgtttaga tatggt 26 

<210> 45 

<211> 26 

<212> BNA 

<2a3> Artificial Sequenee 



<220> 

<223> synthetic PCR primer 
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<400> 45 

accatatcta aacagaccag tacttc 26 

<210> 46 

<211> 23 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic PGR primer 

<400> 46 

cgtcgaggac tggctattgc taa 23 

<210> 47 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PGR primer 

<400> 47 

ttagcaatag ccagtcctcg acg 23 

<210> 48 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PCR primer 

<400> 48 

gagggtctgc tgtttgccca gg 22 

<210> 49 

<211> 22 
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<2X2> DHA 

<2I3> Artificial Sequence 
<220> 

<223> Synthetic PGR primer 

<400> 49 

cctgggcaaa cagcagaccc tc 22 

<210> 50 

<21X> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PGR primer 

<400> 50 

cttcagacca ctgtgaccca gtg 23 

<210> 51 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PCR primer 

<400> 51 

cactgggtca cagtggtctg aag 23 

<210> 52 

<211> 21 

<212> DNA 

<2X3> Artificial Sequence 



<220> 
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<223> Synthetic PCR primer 

<400> 52 

gatcactggg ctgcacggcg g 

<210> 53 

<211> 21 

<212> DN% 

<213> Artificial Sequence 
<220> 

<223> Synthetic PCR primer 

<400> 53 

ccgccgtgca gcccagtgat c 

<210> 54 

<211> 22 

<212> mA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PGR primer 

<400> 54 

tattggtggc ctgcagctct gc 

<210> 55 

<211> 22 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PGR primer 

<400> 55 

gcagagctgc aggccaccaa ta 



-84. 



21 



21 



22 



<210> 56 
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<211> 24 
<212> Dm. 

<213> Artificial Sequence 

<22a> 

<223> Synthetic PGR primer 
<400> 56 

cagtactgct ctggataact tccc 

<210> 57 

<211> 24 

<2a2> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PCR primer 

<400> 57 

gggaagttat ccagagcagt actg 

<210> 5B 

<2ia> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PGR primer 

<400> 58 

aagctggaaa tctgcagcat at cat 

<210> 59 

<211> 25 

<212> DNA 

<213> Artificial Sequence 



85- 



24 



24 
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<220> 

<223> Synthetic PGR primer 

<400> 59 

atgatatgct gcagatttcc agctt 

<210> 60 

<211> 23 

<212> OKA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PGR primer 

<400> 60 

ctcgcttggc tggatgattg tgt 

<210> 61 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PCR primer 

<400> 61 

acacaatcat ccagccaagc gag 

<21D> 62 

<211> 24 

<212> mSA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PGR primer 

<400> 62 

tcetgtgttt tctgacgttc ttcg 
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<210> 63 
<211> 24 
<212> WA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PGR primer 
<400> 63 

cgaagaacgt cagaaaacac agga 

<210> 64 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PGR primer 

<400> 64 

tatctgacgc ctgtttgatt gtgtt 

<210> 65 

<211> 25 

<212> DNR. 

<213> Artificial Sequence 
<220> 

<223> Synthetic PCR primer 

<400> 65 

aacacaatca aacaggcgtc agata 

<210> 66 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Synthetic PCR primer 

<400> 66 

ccagagacag ctgaatctcc tgtt 

<210> 67 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic VCR primer 

<400> 67 

aacaggagat tcagctgtct ctgg 

<210> 68 

<211> 25 

<212> DKA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PCR primer 

<400> 6B 

agcggtgagc tggttaatat tcatg 

<2ID> 69 

<211> 25 

<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PGR primer 

<400> 69 

catgaatatt aaccagctca ccgct 
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<210> 70 

<211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic PCR primer 

<400> 70 

cctcagaatc ctgtcgaagc tea 23 

<210> 71 

<211> 23 

<212> mA 

<213> Artificial Sequence • 
<220> 

<223> Synthetic PCR primer 

<400> 71 

tgagcttcga caggattctg agg 23 

<210> 72 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PGR primer 

<4O0> 72 

gatcagactg ctgcttatcc aaca 24 

<210> 73 

<211> 24 

<212> DHA 

<213> Artificial Sequence 
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<220> 

<223> Synthetic FCR primer 

<400> 73 

tgttggataa gcagcagtct gate 

<210> 74 

<211> 23 

<212> DNA 

<213> Airtificial Sequence 
<220> 

<223> Synthetic PGR primer 

<400> 74 

aggaagtcag ctgactccct tec 23 

<210> 75 

<211> 23 

<212> DiaA 

<213> Artificial Sequence 
<220> 

<223> Synthetic PGR primer 

<400> 75 

ggaagggagt cagctgactt cct 23 

<210> 76 

<211> 21 
<212> 

<213> Artificial Sequence 
<220> 

<223> Synthetic PGR primer 

<4C0> 76 

gcagatctgg atcctcaaga g 21 
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